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Abstract. Sumner's universal tournament conjecture states that any tournament on 2n— 2 
vertices contains any directed tree on n vertices. In this paper we prove that this conjecture 
holds for all sufficiently large n. The proof makes extensive use of results and ideas from a 
recent paper by the same authors, in which an approximate version of the conjecture was 
proved. 

1. Introduction 

1.1. Introduction. A tournament is an orientation of a complete graph. Obviously one 
cannot guarantee any substructures which contain a cycle within an arbitrary tournament. 
On the other hand, Sumner's universal tournament conjecture states that one can find any 
directed tree T within an arbitrary tournament G, even if the order of T is rather large 
compared to that of G. More precisely, the conjecture states that any tournament on 2n — 2 
vertices contains any directed tree on n vertices. Many partial results towards this conjecture 
(made in 1971) have been proved - some of them are described below. Here we prove this 
conjecture for all large n. 

Theorem 1.1. There exists uq such that the following holds. Let T be a directed tree on 
n > uq vertices, and G a tournament on 2n — 2 vertices. Then G contains a copy ofT. 

To see that the bound is best possible, let T be a star with all edges directed inwards, 
and let G be a regular tournament on 2n — 3 vertices. Then every vertex of G has n — 2 
inneighbours and n — 2 outneighbours, and so G does not contain a copy of T, whose central 
vertex has n — 1 inneighbours. There are also 'near-extremal' examples which have a different 
structure to the one given above: let T be obtained from a directed path on £ > 1 vertices by 
adding y := (n — £)/2 outneighbours to the terminal vertex of the path and y inneighbours 
to the initial vertex of the path. Let G consist of regular tournaments Y and Z, each on 
2y — 1 vertices, together with an arbitrary tournament X on I — 1 vertices so that all edges 
are oriented from Z to X, from X to Y and from Z to Y. Then \G\ = 2n — I — 3 as well 
as \T\ = n, and it is easy to see that G does not contain T. These examples will play a 
significant role in the proof (see Section [l.2p . 

In [10], we used a randomised embedding algorithm to prove an approximate version of 
Sumner's universal tournament conjecture, and also a stronger result for directed trees of 
bounded degree. Both of these results will be important tools in this paper. 

Theorem 1.2 (|10|. Theorem 1.4). Let a > 0. Then the following properties hold. 

(i) There exists no such that for any n > no, any tournament G on 2(1 + a)n vertices 
contains any directed tree T on n vertices. 

(ii) Let A be any positive integer. Then there exists hq such that for any n > no, any 
tournament G on (1 + oc)n vertices contains any directed tree T on n vertices with 
A(T) < A. 
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Let f(n) denote the smallest integer such that any tournament on f(n) vertices contains 
any directed tree on n vertices. So Sumner's conjecture states that f(n) = 2n—2. Chung (see 
|16j ) observed that /(n) < n 1+ °^ l \ and Wormald [16] improved this to f(n) < 0(n log n). 
The first linear bound on f{n) was established by Haggkvist and Thomason [3]. Havet [5] 
then showed that f{n) < 38n/5, and later Havet and Thomasse [7] used their notion of 
median orders to improve this to f{n) < 7n/2. Finally El Sahili used the same notion to 
prove the best known bound for general n, namely that f{n) = 3ra — 3. We shall make 
extensive use of this result in this paper (actually, any linear bound would suffice for our 
purposes; the factor of 3 is not essential.) 

Theorem 1.3 (El Sahili [3]). Let T be a directed tree on n vertices, and let G be a tournament 
on 3n — 3 vertices. Then G contains a copy ofT. 

Sumner's conjecture is also known to hold for special classes of trees (see e.g. [H]). In 
particular, Havet and Thomasse [7] proved it for 'outbranchings', again using median orders. 
Here an outbranching is a directed tree T in which we may choose a root vertex t £ T so that 
for any vertex t' G T, the path between t and t! in T is directed from t to t' . (Outbranchings 
are also known as arborescences.) 

Theorem 1.4 (Havet and Thomasse [7]). Let T be an outbranching on n vertices, and let 
G be a tournament on 2n — 2 vertices. Then G contains a copy ofT. 

For many types of trees, Sumner's conjecture holds with room to spare. A classical result 
of this type is Redei's theorem. 

Theorem 1.5 (Redei |T3J). Any tournament contains a spanning directed path. 

This was generalised considerably by Thomason [15] who showed that whenever n is 
sufficiently large, every tournament on n vertices contains every orientation of the path on 
n vertices (this was a conjecture of Rosenfeld). Havet and Thomasse [8 J proved that this 
even holds for all n / 3,5, 7. They also proposed the following generalisation of Sumner's 
conjecture (see [6]): Let T be a directed tree on n vertices with k leaves. Then every 
tournament on n + k — 1 vertices contains a copy of T. Some special cases are known (see 
e.g. [2]). It would be interesting to know whether our methods can be used to prove this 
conjecture. 

As illustrated in the next section, our proof relies on all of the above theorems (i.e. The- 
orems ll.2Hl.5j) . as well as a directed version of Szemeredi's regularity lemma and several 
structural results proved in [lOj. 

1.2. Outline of the proof. In Section [21 we shall introduce some notation, before intro- 
ducing some key ideas and lemmas. In particular we shall define the core tree Ta of a tree 
T. This is a subtree of T consisting of all the 'central' vertices of T, which has the impor- 
tant property that every component of T — Ta is small. This is useful for the problem of 
embedding T in a tournament G, as we may first embed Ta and then proceed to embed 
the components of T — Ta one by one, using the fact that each such component is small. 
We also introduce the notion of an 'almost-regular' tournament G, which is a tournament 
in which every vertex has in- and outdegree approximately equal to |G|/2. Section [2] also 
contains three auxiliary lemmas for embedding a directed tree T in a tournament G which 
are derived from Theorems 1.2 and 1.3 and which we shall use extensively in later sections: 
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• Lemma 12.51 is designed to embed a directed tree T which is similar to an outstar, in 
the sense that T contains a vertex t with no inneighbours such that every component 
of T — t is small. 

• In Lemma [2 .6( we consider a subtree T c of T with the property that every component 
of T — T c is small, showing that a suitable embedding of T c in G can be extended to 
an embedding of T in G. 

• In Lemma 12.71 we consider the case where the vertices of G can be partitioned into 
disjoint sets Y and Z such that almost all edges between Y and Z are directed 
the same way. Here we show that if the vertices of T are partitioned appropriately 
between forests F~ and F + , then to be able to embed T in G it is sufficient to embed 
the largest component of F + within Y. 

We begin the proof of Theorem 11.11 in Section El by proving the case where |Ta| = 1 
(Lemma 13. ip . Note that the extremal case when T is a star is covered by this case. To 
do this, we first embed the single vertex of T\ to a vertex of G with appropriate in- and 
outdegree. We then use Lemma [231 Lemma l2,6l and Theorem 11.41 to embed the components 
of T — T\ appropriately among the remaining vertices of G to obtain a copy of T in G. 

Then in Section |4] we introduce the digraph regularity lemma, which yields a partition 
of the vertex set of G into clusters so that the edges between pairs of clusters of G form 
quasi-random bipartite subgraphs. We use the regularity lemma to prove 

• Lemma 14.61 which states that Theorem 11.11 holds in the case where G is almost- 
regular and Ta is small enough to be embedded within a single cluster of G. 

To prove this, we first select an appropriate cluster or pair of clusters of G in which to 
embed Ta, and then use Lemma 12.61 to extend this embedding of Ta to an embedding of T 
in G. We also prove that if we additionally assume that |Ta| > 2 then the result holds with 
room to spare, i.e. we can allow G to be of order (2 — a)n, where a is small. 

Next, in Section [5] we consider the case when the tournament G is a 'robust outexpander'. 
The latter implies that every set S of reasonable size has a large outneighbourhood. A key 
lemma in |10j showed that if G is a robust outexpander tournament on at least (2 + a)n 
vertices with large minimum semidegree, then G contains any directed tree T on n vertices. 
However, the an error term was only required in the case where Ta is small. In Section [5] 
we modify the argument from |10] to prove 

• Lemma [5.3l which states that if Ta is large, then any robust outexpander tournament 
on at least (2 — a)n vertices with large minimum semidegree contains a copy of T. 

(The proof relies on further results from |10].) It is easy to see that any almost-regular 
tournament is a robust outexpander tournament. So we can combine Lemmas 14.61 and 15.31 
to deduce 

• Lemma l5.8( which states that Theorem 1 1 . 1 1 holds with a little room to spare if G is 
a large almost-regular tournament and |Ta| > 2. 

We also prepare the ground for the proof of Theorem 1 1.1 1 by modifying an algorithm from [10] 
to prove Lemma 15.21 This states that any tournament G may be split into disjoint sub- 
tournaments, each of which is either small or a robust outexpander with large minimum 
semidegree. This will allow us to apply our results on robust outexpander tournaments to 
(subtournaments of) general tournaments G. 

In Section [6] we prove Lemma 16.11 which states that Theorem 11.11 holds for all directed 
trees T for which Ta is small. In particular, the 'near extremal' construction described 
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in the introduction is dealt with in this part of the proof. Lemma 16.11 is proved in four 
steps. Firstly, in Lemma 16.21 we show that we may assume the tournament G contains two 
almost-regular subtournaments on vertex sets Y and Z which between them contain almost 
all of the vertices of G. Using this structural information, we show in Lemmas 16.31 and 16.41 
that we may assume that Ta is a short directed path and that most of the remainder of 
T is attached to the endvertices of this path. (Lemma 15.81 is used as a tool here: we can 
apply it to embed a suitable subforest of T into Y or Z, and afterwards use Lemma 12.71 to 
embed the remainder of T.) We then consider the case |Ta| = 2 separately, proving that 
Theorem 11.11 holds for such T. This allows us to assume for the proof of Lemma 16.11 that 
|Ta| > 3. Since Ta is a directed path, we can use Redei's theorem to embed Ta within a set 
W of |Ta| vertices which have high in- and outdegree, and then apply Lemmas 12.51 and 12.61 
to complete the embedding again. 

Finally, in Section [7] we complete the proof of Theorem ll.il By Lemma [6.1 1 we may assume 
for this that Ta is large. None of the extremal or near-extremal cases satisfy this condition, 
so we will always have a little room to spare in our calculations in this part of the proof. 
We proceed by using Lemma 1531 to split the tournament G into disjoint robust outexpander 
subtournaments of large minimum semidegree. If there is just one such subtournament 
then this subtournament contains a copy of T by Lemma 15.31 By using Lemma 12.71 we 
prove Lemma 17.21 which shows that if there are two such subtournaments then these must 
also together contain a copy of T. We may therefore assume in the proof of Theorem 11.11 
that there are at least three such subtournaments of G. In this case we use Lemma 15.31 
Theorem 11.21 and Theorem 11.31 to embed T into these subtournaments. 

2. Definitions and basic tools 

2.1. Notation. For a graph G, we write V(G) and E{G) to denote the vertex set and 
edge set of G respectively. Then \G\ := \V(G)\ denotes the number of vertices of G, and 
e(G) := l-E^G)! is the number of edges of G. We shall sometimes write v G G to mean 
v G V(G). A tree is a connected graph which does not contain any cycles, and we say that 
a vertex of a tree is a leaf if it has degree one. 

A directed graph G, or digraph, consists of a vertex set V(G) and an edge set E{G), 
where each edge e G E is an ordered pair (u, v) of vertices of G. For vertices u, v G V(G) 
we write u — > v or v u to denote that (u, v) G E(G). If u — > v then we say that v is an 
outneighbour of u, that u is an inneighbour of v, and that the edge (u, v) is directed from 
u to v. Sometimes we shall use the term neighbour of v to mean a vertex which is either 
an inneighbour or an outneighbour of v. For any vertex v G G, we denote the set of all 
outneighbours of v by Nq(v), or simply N + (v) when G is clear from the context. Similarly 
we write Nq(v) or N~(v) to denote the set of all inneighbours of v. Then the outdegree of v, 
denoted d^v), is defined by d^v) := \Nq{v)\. Similarly the indegree of v, denoted d^v), 
is defined by d^v) := \Nq(v)\. Again we may write d + (v) or d~(v) when G is clear from 
the context. We define the minimum outdegree of G, denoted 5 + (G), to be the minimum 
of d + (v) taken over all vertices v G G, and the minimum indegree, denoted 5~(G), to be 
the minimum of d~(v) taken over all vertices v G G. Then the minimum semidegree of G, 
denoted 6°(G), is the minimum of 5~(G) and J + (G). We write G[U — > V] to denote the 
bipartite subgraph of G formed by edges directed from U to V. 

We say that a directed graph G is an oriented graph if for any u,v G G at most one ofu—>v 
and u v holds. So an oriented graph may be obtained by assigning a direction to each 
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edge of an undirected graph. We call this undirected graph the underlying graph, and denote 
it by G un der- An oriented graph is a tournament if for any distinct u, v G V(G) precisely one 
of u — )■ v and u v holds. Equivalently, the underlying graph of a tournament is a complete 
graph. A directed tree is an oriented graph T for which the underlying graph T un( ier is a 
tree. The maximum degree of T, denoted A(T), is defined to be equal to A(T nra ^ er ). A tree 
or directed tree T may be rooted by identifying a specific vertex r as the root of T. 

Let T be a directed tree, and let x be a vertex of T. Then for any edge e G £7(T) incident to 
x, the weight of e at x, denoted w e (x), is the number of vertices y of T for which e (ignoring 
the orientation) is the first edge of the path in T un d er from x to y. We say that a component 
of T — x is an incomponent of x if the unique edge between x and this component is directed 
towards x, and an outcomponent of x if this edge is directed away from x. The inweight of x, 
denoted w~(x), is then the number of vertices in incomponents of x, and the outweight of x, 
denoted u> + (x), is the number of vertices in outcomponents of x. Equivalently, the inweight 
of x is the sum of w e (x) taken over all edges e incident to x which are directed towards x, 
and the outweight can be defined similarly. 

In the same way we define incomponents and outcomponents for a subtree T c of T. Indeed, 
for any component T' of T — T c there is precisely one edge between T' and T c . If this edge 
is directed towards a vertex of T' then we say that T' is an outcomponent of T c , whereas if 
this edge is directed towards T c we say that T' is an incomponent of T c . As when T c is a 
single vertex we define the inweight of T c , denoted w~(T c ), to be the number of vertices in 
incomponents of T c , and the outweight ofT c , denoted u> + (T c ), to be the number of vertices 
in outcomponents of T c . Again these inweights and outweights can equivalently be defined 
as the sum of the weights of the appropriate edges of T. 

Throughout this paper we shall write x <C y to indicate that for any y > there exists 
xq > such that for any < x < xo the subsequent statements hold. Such statements with 
more variables are defined similarly. 

2.2. The core tree. Let T be a tree on n vertices, and let A > 2 be fixed. Then we say 
that a vertex x of T is A-core if every edge e incident to x has w e {x) < (1 — 1/A)n. We 
call the subgraph of T induced by A-core vertices of T the core tree ofT with parameter A, 
and denote it by Ta- With this definition, for any tree T, the core tree Ta is the same as 
the A-heart of T considered by Haggkvist and Thomason in [4]. The following proposition 
from pLOj gives some important properties of the core tree (these properties are also stated 
in@l). 

Proposition 2.1 (|10|. Proposition 4.2). Let T be a tree on n vertices and let A > 2. Then: 

(i) Ta is a tree containing at least one vertex. 

(ii) w e (x) > n/A if e = xy is an edge ofT&. 

(iii) A(T A ) < A. 

(iv) Every component subtree T' of T — Ta has \T'\ < n/A. 

Note that Ta is an undirected tree obtained from an undirected tree T. However we will 
frequently refer to the core tree of a directed tree T; this means the directed tree formed 
by taking the core tree Ta of the underlying graph T un d er (an undirected tree) of T and 
directing each edge of Ta as it is directed in T. 

The following proposition is needed in the proof of Lemma 12.31 Essentially the latter 
states that if trees T 1 and T 2 almost partition a tree T, then the core tree Ta is not much 
larger than U . 
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Proposition 2.2. Let T be a tree on n vertices, let x be a leaf of T , and let A > 2. Then 
\{T-x) A \ > |T A |-1. 

Proof. Let y be a vertex of Ta — (T — x) A , and let z be an arbitrary vertex of (T — x) A . 
Then for some edge e incident to y we have w e (y) > (1 — 1/A)(n — 1) in T — x. Since 
by Proposition I2.1f iv) the component of (T — x) — (T — x) A containing y contains at most 
(n— 1) /A vertices, this edge must in fact be the first edge of the path in T from y to z. If e is 
also the first edge of the path in T from y to x then we have w e (y) > (1 — 1/A)(n — 1) + 1 > 
(1 — 1/A)n in T, and so y G' Ta, giving a contradiction. So y must lie on the path in T from 
x to z. Since y G Ta we must have w e {y) < (1 — 1/A)n in T, and so in T we have 

(1 - i)n - 1 < (1 - i)(n - 1) < ™ e (y) < (1 - 

Clearly this can hold for at most one vertex y on the path from x to z. So |Ta — (T— x) A \ < 1, 
as desired. □ 



Lemma 2.3. Lei T be a tree on n vertices, let A > 2 and Zei 7, a > 0. Zei T 1 and T 2 

be subtrees of T such that IT 1 U T 2 \ > (1 — "y)n. Suppose also that |T A |, |T A | < an. Then 
\T A \ < 7n + 2an + 2n/A. 

Proof. Arbitrarily choose vertices x\ G T A and X2 G T A , and let P be the path from x\ to 
X2 (so P is also a subtree of T). Then let T* := T 1 UPUT 2 , so |T*| > (1 — 7)71. Furthermore, 
T* can be formed from T by repeated leaf-deletions. So by Proposition 12.21 we must have 
\T\ - \T*\ > |T A | - \T A \, and so 

(1) \T A \ < \T\ - \T*\ + |T A | < 7 n - |P - (T 1 U T 2 )| + \T A \. 

Let T* := T A U P U T A . We claim that T A C T*. Indeed, suppose for a contradiction 
that there exists a vertex y G T A — T*. Since T* is a subtree of T, every vertex of T* lies 
in the same component C of T* — y. Note that T* — C is a tree. Now, T A and T A are 
subtrees of C, so by Proposition 12. lf iv) T* — C contains at most |T 1 |/A vertices of T 1 and 
at most |T 2 |/A vertices of T 2 . Let e be the edge of T* between y and C. Then since y G T A , 
w e (y) < (1 - 1/A)|T*| in T*. So at least |T*|/A vertices of T* lie in components of T* - y 
other than C. As every vertex of P lies in C, either at least |T 1 |/A vertices of T 1 lie in 
components of T* — y other than C, or at least |T 2 |/A vertices of T 2 lie in components of 
T* — y other than C. In the former case this implies that T* — C contains more than IT 1 |/A 
vertices of T 1 , and in the latter case this implies that T* — C contains more than |T 2 |/A 
vertices of T 2 . In either case this yields a contradiction. 

Now, |T*| < 2cm + |P - (T A U T A )|. Since (P n T 1 ) — T A is contained within a single 
component of T 1 - T A , |(P n T 1 ) - T A | < (T^/A, by Proposition O^iv) . Similarly |(P n 
T 2 )-Tl\ < \T 2 \/A. So 

\T A \ < \T*\ <2an + (|r 1 | + |T 2 |)/A + |P-(T 1 ur 2 )|. 

So by © 

|T A | < 7n + ((T 1 ! + |T 2 |)/A + 2an < jn + 2n/A + 2an. 



□ 
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2.3. Almost-regular tournaments. In a regular directed graph G, every vertex v has 
d + (v) = d~(v) = e(G)/\G\. We say that a directed graph G is 7 -almost-regular if every 
vertex v G G has d + (v), d~(v) > (1 — 7)e(G)/|G|. In particular, if G is a tournament then G 
is 7-almost-regular if and only if every vertex v E G has d + (v),d~(v) > (1 — 7)(|G| — l)/2. 
The next proposition shows that for a large tournament G only one of these two bounds is 
needed to ensure that G contains an almost-spanning almost-regular tournament. 

Proposition 2.4. Suppose that 1/n < a < 7 < 1. Let G be a tournament on n vertices 
in which at least one of the following holds: 

(i) d + (v) > (1 — a)(n — l)/2 for every v € G, 

(ii) d~(v) > (1 — a)(n — l)/2 /or every v £ G, 

(iii) d + (v) < (1 + a)(n — l)/2 /or every v € G, 

(iv) d~(v) < (1 + a)(n — l)/2 /or every v £ G. 

Then G contains a 7- almost-regular subtournament G' on at least (1 — 7)n vertices. 

Proof. We shall prove (i); then (ii), (iii) and (iv) follow immediately. Suppose that G has 
at least ^fan vertices with d + {v) > (1 + y/bx){n — l)/2. Then 

o) = = E d » >(!"«) (") + V5n • V^(n - l)/2 = (" 

giving a contradiction. So there are at most \J~an vertices of G with d + (v) > (1 + y / a)(n — 
l)/2. Delete all of these vertices of G, and let G' be the obtained subtournament. Then 
n — \J~an < \G'\ < n. Also, every vertex of G 1 has 

+ (l~«)(n-l) r (1- 7 )(|G'|-1) 
> Van > 



and 



, r- (l + Vb7)(n-l) (1-7)(|G'|-1) 



2 ~ 2 

So G" is a 7-almost-regular tournament on at least (1 — 7)71 vertices, as desired. □ 



2.4. Some embedding results. The following three lemmas will be the main tools we 
shall use to embed directed trees in tournaments. We use Theorem 11.31 in the proofs of all 
three lemmas, although the factor of 3 in Theorem 11.31 is not critical to our proof; any linear 
bound would suffice. For the proof of Lemma 12.71 we also require the use of Theorem 11.21 

Lemma 2.5. Let T be a directed tree on n vertices, rooted at t, such that t has no inneigh- 
bours in T , and every component ofT — t contains at most d vertices. Let G be a tournament 
whose vertex set is partitioned into three sets, {v},N and X, where \N\ > n — 1, every vertex 
of N is an outneighbour of v, and at least 3d vertices of N each have at least 6d inneighbours 
in X and at least 6d outneighbours in X . Then T can be embedded in G in such a way that 
t is embedded to v and at most Ad vertices of X are occupied by this embedding. 

Proof. Let N 1 C N consist of all vertices of N with at least 60! inneighbours in X and at 
least 60! outneighbours in X. Then \N'\ > 3d. We begin by embedding t to the vertex v. 
Now let Ii, . . . ,T r be the components of T — t, in order of decreasing order. For each i, let 
ti be the single vertex of Tj which is an outneighbour of t. Then we shall embed T%, . . . , T r 
in turn in N L) X, with each ti embedded in iV and each Tj embedded in the vertices not 
occupied by the embeddings of T±, . . . , Ti-i- This will give an embedding of T in G. So 
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suppose that we have embedded T±, . . . , Tj_i in this manner, and we now wish to embed 
Tj. Then at most n — 1 vertices of T have been embedded. At least one of these vertices 
(namely t) was not embedded in N, so at least one vertex of N must be unoccupied. 

Suppose that N' contains at least one unoccupied vertex Vi, and also that fewer than 3d 
vertices of X have been occupied. Then Vi has at least 3d unoccupied inneighbours in X and 
at least 3d unoccupied outneighbours in X. Embed tj to Vi. We then proceed through the 
outcomponents of tj in Tj in turn. Suppose that when we come to embed an outcomponent 
of ti we have previously embedded m vertices of Tj. Then the current outcomponent has 
order at most d — m. Also, Vi has at least 3d — m> 3{d — m) outneighbours in X which have 
not yet been occupied, so by Theorem 11.31 we may embed this outcomponent amongst the 
outneighbours of Vi in X. Similarly we may embed the incomponents of tj in turn amongst 
the inneighbours of V{ in X, and so we obtain an embedding of Tj in the unoccupied vertices 
of G. Note that all vertices of Tj apart from tj are embedded in X. 

Now suppose instead that every vertex of N' has been occupied, but still that fewer than 
3d vertices of X have been occupied. Then at least one of the Tj with j < i must have had 
\Tj\ = 1, and so Tj consists of one single vertex, namely tj. We may therefore embed tj to 
any unoccupied vertex of N (recall that there is at least one such vertex). 

Finally, suppose that at least 3d vertices of X have been occupied. Then at least 3d + 1 
vertices of T have been embedded outside N, and so N contains at least n— 1 — (n— (3d+l)) = 
3d unoccupied vertices. Since |Tj| < d, by Theorem 11.31 we may embed Tj among these 
unoccupied vertices. 

By embedding each Tj in this fashion we obtain an embedding of T in G with t embedded 
to v . Furthermore, the only vertices embedded in X are those in some Tj such that when 
we came to embed Tj, N' contained at least one unoccupied vertex Vi, and fewer than 3d 
vertices of X had been occupied. The embedding of Tj occupied at most another d vertices 
of X, and so at most 4d vertices of X can have been occupied in total. □ 

Lemma 2.6. (a) Let T be a directed tree, and let T c be a subtree of T such that every 
component of T — T c contains at most d vertices. Let G be a tournament whose 
vertices are partitioned into two sets S and N such that for every vertex v G S we 
have 

(i) \N+(v) n N\ > |T - T c | + 2d, and 

(ii) \N-(v) n N\ > |T - T c | + 2d. 

Then any embedding ofT c in G[S] can be extended to an embedding of T in G. 

(b) Suppose that in addition to the above assumptions we choose a set N' C N and an 
integer r < \T — T c \, so that every vertex v G S satisfies 

(hi) \N + (v) n N'\ > r + 2d, and 
(iv) \N (v)DN'\ >r + 2d. 

Then any embedding of T c in G[S] can be extended to an embedding of T in G such 
that at least r vertices of T are embedded in N' . 

(c) Suppose that no edges ofT are directed from T c to T — T c . Then conditions (i) and 
(Hi) may be dropped without affecting the validity of the above result. Likewise if no 
edges of T are directed from T — T c to T c , then the above results hold even without 
conditions (ii) and (iv). 

Proof. Let n := |T|. We shall prove (b) and (c); for (a), apply (b) with r := |T — T c | 
and N' := N. Let T±,...,T q be the components of T — T c , so |Tj| < d for each i. Suppose 
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now that we have successfully extended the embedding of T c in G[S] to an embedding of 
T c UTiU- • -UT s _i in G. We shall demonstrate how to extend this embedding to an embedding 
of T c UTiU- • -UT S in G. Indeed, there is precisely one edge between T c and T s . Let t 6 T c and 
t s S T s be the endvertices of this edge, and let v be the vertex in S to which t is embedded. 

Suppose that t s is an outneighbour of t. By (i), v has at least \T — T c | + 2d outneighbours 
in N. At most |Ti| + • • • + |T s _i| of these outneighbours are occupied by the embedding of 
T c U T\ U • • • U T s _i, and so v has at least \T S \ + 2d > 3\T S \ outneighbours in N which are not 
occupied by this embedding. Now, by (hi), v has at least r + 2d outneighbours in N' . If at 
most r — \T S \ of these outneighbours are occupied by the embedding of T c U T\ U • • • U T s _i, 
then by Theorem 11.31 we may embed T s amongst the at least 2d + |T S | > 3|T S | unoccupied 
outneighbours of v in N'. If instead r — k of these outneighbours are occupied, for some 
1 < k < \T S \ — 1, then by Theorem 11.31 we may embed T s amongst the 2|T S | + k unoccupied 
outneighbours in N' and some arbitrary \T S \ — k outneighbours of v in N\ N'. Then at least 
k vertices of N' will be occupied by this embedding of T s . Finally, if at least r outneighbours 
of v in N' have been occupied by this embedding, then we may embed T s within the at least 
3|T S | unoccupied outneighbours of v in N. 

If instead t s is an inneighbour of t, then we may extend the embedding similarly, using 
(ii) and (iv) rather than (i) and (hi). So we may extend the embedding of T c in G[S] to 
an embedding of T in G by proceeding through each Ti in this manner. Also conditions (i) 
and (hi) will only be required if at least one edge of T is directed from T c to T — T c , and 
conditions (ii) and (iv) will only be required if at least one edge of T is directed from T — T c 
to T c . Finally, note that after each T s is embedded, either every vertex of T\ U • • • U T s will 
have been embedded in N', or at least r vertices of T\ U • • • U T s will have been embedded 
in N'. Since \T% U T2 U • • • U T q \ = \T — T c \ > r, we can be sure that at least r vertices of N' 
will be occupied by the embedding of T, as desired. □ 

Lemma 2.7. Suppose that 1/n < 7 < a < 1. Let T be a directed tree on n vertices, 
and let forests F~ and F + be induced subgraphs ofT such that V(F~) and V(F + ) partition 
V(T) and every edge between F~ and F + is directed from F~ to F + . Let and be 
the largest and second largest components of F + respectively. Also, let Y and Z be disjoint 
sets such that 

\Y\ > \F + \ + |T 2 + | + an and \Z\ > 2\F~\ + an. 

Let G be a tournament on vertex set Y U Z such that every vertex of Y has at most jn 
outneighbours in Z, and every vertex of Z has at most jn inneighbours in Y . Then any 
embedding of in G[Y] can be extended to an embedding of T in G. 

Proof. Let Ti,...,T r be the components of F~ and F + , ordered so that T\ = and 
so that for each 2 < i < r there is exactly one edge of T between Tj and T\ U • • • U Ti-\- 
Then we have an embedding of T\ in G[Y]. We shall proceed through the trees Ti in turn, 
embedding each T, in G[Y] if Tj is a component of F + , or in G\Z\ if Ti is a component of 
F~ . Each Ti will be embedded so that the embeddings of T%, . . . ,Ti form an embedding of 
the subtree of T induced by the vertices of Ti, . . . ,Tj. Suppose that we have successfully 
embedded T±, . . . , Ti—\ in this manner, and we wish to extend this embedding to include Tj. 
Note that there is precisely one edge e between Tj and T\ U • • • U Tj_i. Let t be the end vertex 
of e in Ti U • • • U Tj_i, and let v be the vertex to which t was embedded. 

If T is a component of F + , then t £ F~, so v G Z. In this case we will embed Tj 
within the unoccupied outneighbours of v in Y. Since v £ Z, \N + (v) n Y\ > \Y\ — 772 > 
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+ |T 2 + | +an/2. At most \F + \ - |Tj| of these vertices are occupied by the embeddings of 
T\, . . . , Tj_i. Since i > 2, T, is not the largest component of F + , and so has order |T| < |T 2 + |. 
So at least 2|Tj| + an/ 2 outneighbours of v in Y remain unoccupied. So if |T| > an/2 then 
by Theorem ll.2f i) we may embed Tj in these unoccupied vertices of N + (v)C\Y . On the other 
hand, if |T| < an/2 then by Theorem 11.31 we may embed T in these unoccupied vertices of 
N + (v)nY. 

Now suppose instead that T is a component of F~ . Then t £ F + , so v £ Y. Here we 
will embed T within the unoccupied inneighbours of v in Z. Since v EY, \N~(v) Pi Z\ > 
\Z\ — 771 > 2|T~| + an/2, and at most |T~| — |T| of these vertices are occupied by the 
embeddings of T\, . . . ,T_i. So at least 2|T| + an/2 such vertices remain unoccupied. So 
as before, if |T| > an/2 then by Theorem 11.2( 1) we may embed T in these unoccupied 
vertices of N~(v) n Z, whereas if |T| < an/2 then by Theorem 11.31 we may embed T in 
these unoccupied vertices of N~(v) fl Z. By proceeding through all of the trees T in this 
manner we will obtain an embedding of T in G. □ 

Observe that if in the statement of Lemma 12.71 we let T£~ and T 2 ~ be the largest and 
second-largest components of F~ respectively, and replaced the conditions on the sizes of Z 
and Y by the conditions that |y| > 2|T + | + an and \Z\ > \F~\ + |T 2 ~~| + an, then we could 
conclude that any embedding of T£~ in G[Z] can be extended to an embedding of T in G. 
To see this, either note that the proof will still be valid with appropriate changes (switching 
inneighbours and outneighbours and so forth) or observe that this is the effect of reversing 
the direction of every edge of T and every edge of G, in which case the embedding problem 
is the same. Sometimes when referring to Lemma 12.71 we will implicitly mean this 'dual' of 
Lemma 12.71 instead. 

3. Embedding trees whose core tree is a single vertex 

In this section we shall verify that Sumner's universal tournament conjecture holds for 
large directed trees T whose core tree Ta contains only one vertex, that is, trees which are 
'star-shaped'. Such trees can be embedded by selecting an appropriate vertex to which to 
embed the single vertex of T\, and then embedding the components of T — Ta one by one. 

Lemma 3.1. Suppose that 1/n <C 1/A <C 1. Let T be a directed tree on n vertices with 
|Ta| = 1, and let G be a tournament on 2n — 2 vertices. Then G contains a copy ofT. 

Proof. Introduce constants a and 7 with 1/A C a < 7 < 1. Let t be the single vertex 
of Ta, let y be the outweight of T\, and let z be the inweight of T\. Also, let T\ be the 
subtree of T formed by t and all of its outcomponents, and let T2 be the subtree of T formed 
by t and all of its incomponents. Then y + z = n — 1, |Ti| = y + 1 and IT2I = z + 1. Now, 
suppose that G contains a vertex v such that 

(i) either d + {v ) > y + 2n/A or y = 0, and 

(ii) either d~{v) > z + 2n/A or z = 0. 

Then embed t to v. By Proposition 12.11 each component of T — t contains at most n/A 
vertices. So by Lemma [2.61 we may extend the embedding of t in {v} to an embedding of T\ 
in {v} U N + (v) (since if y = then T\ consists of the single vertex t). Also by Lemma 12.61 
we may extend the embedding of t in {v} to an embedding of T2 in {v} U N~(v) (since if 
z = then t is the only vertex of T2). These two embeddings only overlap in the vertex v, 
and so combining these two embeddings gives an embedding of T in G. 
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So we may assume that every vertex v S G has either d + (v) < y + 2n/A or d (v) < 
z + 2n/A. Let Y := {v £ G : d + (v) < y + 2n/A} and let Z := {v £ G : d~ (v) < z + 2n/A}. 
Then every vertex of G lies in precisely one of Y and Z, so \Y\ + \Z\ = 2n — 2. Thus we 
must have either \Y\ > 2y or |Z| > 2z. Furthermore, if y = and |Y| > 1 then each v £ Y 
has c? + (t>) < 2n/A and therefore d~(v) > z + 2n/A, and so satisfies (ii). We may therefore 
assume that if y = then |Y| = and similarly that if z = then |Z| = 0. So without loss 
of generality we may assume that \Y\ > 2y and y > (otherwise reverse the direction of 
every edge of T and every edge of G; then we would have |Y| > 2y and y > at this stage, 
and the embedding problem is the same). Observe that by definition of Y we must also have 
\Y\ < 2y + 4n/A + 1. 

Now suppose that y > an. Since y G N and |Y| > 2y, Y must contain a vertex v which 
satisfies \N + (v) (~)Y\ >y. Choose a subset N' C JV + (i;) n Y of size y. For any vertex u € Y, 

d+ ( u ) = |jV+(u) n Y| < d%(u) <y + 2n/A < (1 + a)(|Y| - l)/2. 

So by Proposition 12.41 G[Y] contains a 7-almost-regular tournament on at least 2(1 — ^)y 
vertices. So at most |Y| —2(1— 7)?/ < 37?/ vertices of Y have fewer than (1— 2~f)y inneighbours 
in Y or fewer than (1 — 2<y)y outneighbours in Y. Since \N'\ = y, at most 671/ + 1 vertices 
of N' have more than (1 — 37)?/ inneighbours in N', and at most 672/ + 1 vertices of iV' 
have more than (1 — 37)2/ inneighbours in N'. So at least (1 — 167)1/ vertices of N' have 
at least jy inneighbours in Y \ N' and at least 72/ outneighbours in Y \ N'. Certainly 
therefore at least 3n/A vertices of N' have at least 6n/A inneighbours in Y \ ({f} U N') 
and at least 6n/A outneighbours in Y \ ({v} U A r '). So by Lemma 12.51 we may embed T\ 
in Y, with t embedded to v, and at most 4n/A vertices embedded outside N' U {v}. Let 
V be the set of vertices of G not occupied by this embedding of T\. Since v has at least 
\G\ — 1 — (y + 2n/A) > z + 6n/A inneighbours in G, all outside N'L){v}, v must have at least 
z + 2n/A unoccupied inneighbours in V. So by Lemma 12.61 we may extend the embedding 
of t in {v} to an embedding of T2 in {v} U V. These two embeddings only overlap in the 
vertex v, and so combine to give an embedding of T in G. 

So we may assume that 1 < y < an. Then every vertex v G Y has 

(2) d~(v) > \G\ -1-y -2n/A>n + 2n/A. 

Let T3 be the subtree of T formed by every vertex t' 6 T for which T contains a directed 
path from from t to i'. Then i £ T3, and (taking t as the root vertex) T3 is an outbranching. 
Also T3 C Ti, so I T3 1 < y + 1, and so by Theorem 11.41 we ma y embed T3 in G[Y]. Since 
Ta Q T3, by Proposition I2.1( iv) each component of T — T3 contains at most n/A vertices. 
So as every edge of T between T — T3 and T3 is directed from T — T3 to T3 , and also since 
by ([2]) every vertex of Y has at least \T — \ + 2n/A inneighbours which were not occupied 
by the embedding of T3, we may extend the embedding of T3 in G[Y] to an embedding of T 
in G by Lemma 12.61 □ 

4. The regularity lemma and its applications to embedding trees 

In this section we shall present a degree form of the regularity lemma for directed graphs, 
and show how this may be used to embed trees. In particular, the regularity lemma is 
useful for embedding directed trees T for which Ta is substantially smaller than the size of a 
cluster obtained by applying the regularity lemma to a tournament G; our approach here is 
essentially to select an appropriate cluster in G in which to embed so that we may then 
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embed the components of T — Ta in the remaining clusters of G. By using this method we 
shall prove Lemma 14.61 which states that Theorem 11.11 holds in the case where G is a large 
and almost-regular tournament, and T is a directed tree such that Ta is small. 

Let U and V be disjoint sets, and let G be a directed graph on vertex set U U V. Recall 
that G[U — > V] denotes the bipartite subgraph of G formed by edges directed from U to V. 
The density from U to V ', denoted d(G[U — > V]), is then defined by 



We say that G[U — >■ V] is e-regular if for any U' C U and V' C V with \U'\ > e\U\ and 
> e\V\ we have -> V]) = d(G[i7 -> V]) ± e. 



The next lemma is the degree form of the regularity lemma which we shall use. A regularity 
lemma for digraphs was proven by Alon and Shapira [TJ. The degree form follows from this 
in the same way as in the undirected case (see |llj for a sketch of the latter) . 

Lemma 4.1 (Regularity Lemma for directed graphs). Suppose that 1/n <C 1/M <C 1/M' <C 
e. Let G be a directed graph on n vertices. Then there exists a partition of V(G) into 
Vq, . . . , Vk and a spanning subgraph G' of G such that 



(5) d,Q,(x) > d^(x) — en for all vertices x E V{G), 

(6) for all i G [k] the directed graph G'[Vi] is empty, 

(7) for all i,j E [k] with i ^ j the directed graph G'[Vi —> Vj] is e-regular. 

We say that an oriented graph G on clusters Vi, ■ ■ ■ , V*. of equal size is an e-regular cluster 
tournament if for any i,j E [k] with i 7^ j the subdigraph G[V{ —> Vj] is e-regular and 
for any i E [k] the subdigraph G[Vi] is a tournament. If G is a cluster tournament on 
clusters Vi,...,Vk then we shall denote the density of G[Vi — > Vj] by dij for any i,j E [k] 
(the tournament G will be clear from the context). The following corollary of the regularity 
lemma shows that any sufficiently large tournament G contains an almost-spanning e-regular 
cluster tournament G* such that vertices have similar in- and outdegrees in both G and G* . 

Corollary 4.2. Suppose that 1/n <C 1/M <C 1/M' <C e. Let G be a tournament on n 
vertices. Then there exist disjoint subsets V\,... , Vk Q V(G) of equal size and a subgraph 
G* C G on vertex set V\ U • • • U Vk such that: 

(i) M' <k< M, 

(ii) G* is an e-regular cluster tournament, 



(iv) d,Q*(x) > ^(x) — 2en for all vertices x E V(G), and 

(v) d,Q,(x) > dg(x) — 2en for all vertices x E V(G). 

Proof. Apply Lemma l4~T1 to obtain a partition Vo, • • • , Vk of V(G) and a subgraph G'CG 
which satisfy the conditions of Lemma 14. 1[ In particular (i) and (hi) are satisfied. Now 
form G* from G'\V\ U • • • U Vk] by adding every edge of G for which both endvertices lie in 
the same cluster Vi. So G* C G, and by (7) of Lemma 14.11 and the fact that G*[V^] is a 



d(G[U -+ V]) : 



<G[U -> V]) 

\u\\v\ 




(hi) Uie[k)Vi>a-e)n, 
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tournament for each i G [k] we have (ii). Finally note that using (4) of Lemma 14.11 we have 

d^,,(x) > d%,(x) -\V \>d+(x)- 2en. 
Similarly dQ„{x) > d^(x) — 2en using (5) of Lemma 14.11 □ 

It follows immediately from the definition of regularity that if U and V are sets of size 
m, and G[U — > V] is e-regular with density d, then all but at most 2em vertices of U have 
(d ± e)m outneighbours in V. The next lemma is a generalisation of this fact, considering 
the number of outneighbours of vertices in one cluster within a cluster tournament. 

Lemma 4.3. Suppose that 1/m -C 1/k -C £ <C s! -C 1- Let G be an e-regular cluster 
tournament on clusters V\, . . . , V&, each of size m. Let V- C Vj for each j G [k] be fixed. 
Then for any i, all but at most e'm vertices ofV{ have Y2je[k]\{i} dij\Vj\ ±e'km outneighbours 
in Uje[k]\{i} V j and Eje[fe]\{i} d ji\ V j\ ± £ ' km inneighbours in \J je[k] \ {i} 

Proof. Fix some i G [k]. Then let L be the set of all j G [k] \ {i} such that \Vj\ > em 
and dij > ^fe. For each j G L, let Aj denote the set of vertices of Vi which have fewer than 
(1 — ^)dij\ V-\ outneighbours in V-. Then for each j G L, the subdigraph of G[Vi — > Vj] 
induced by Aj and Vj has density less than (1 — yfe)dij < dij — e. Since G[Vi — > Vj] is 
£-regular with density d^, and \V!\ > em, we must have \Aj\ < em. 

Now, fix a vertex v G V. Suppose that v appears in at most of the sets Aj with 

j G L. Then 

l^»n|J^I> £ (l-v^Wl 

jeL j£L:v£Aj 

> (i-v^Hii^/i- E Wi- E 

je[k]\{i} je[k]\(W{i}) jeL-.veAj 

— dij\Vj\ — \fekm — \fekm — \fe\L\m 
ie[fc]\{i} 

— E dij\Vj\ — 3\/ekm. 
ie[fc]\{i} 

Since at most yfem vertices v G Vi appear in more than -v/ej^l of the sets Aj with j G L, we 
may conclude that there are at most yfem vertices v G Vi with fewer than X^e[fc]\{i} dij\V-\ — 
3^/ekm outneighbours in Uje[fc]\{i} Vj. A similar argument shows that there are at most ^fem 
vertices v G Vi with more than Ysje{k]\{i} dij \Vj\ + 3^/ekm outneighbours in Uje[fc]\{i} Vj. 

Now, let L' be the set of all j G [k] such that \Vj\ > em and dj, > -^/e. Then the same 
argument applied to inneighbours rather than outneighbours shows that there are at most 
yfem vertices v G Vj with fewer than J2je[k]\{i} dji\Vj \ — 3^/ekm inneighbours in UjG[A;]\{j} Vj 
and at most \J~e~m vertices v G Vi with more than Ylj£[k]\{i} dji\Vj\ + 2>yfekm inneighbours 
in Uje[fc]\{i} Vj' Since e <C e' , this completes the proof. □ 

The next two lemmas will be used in the proof of Lemma 14.61 we state them separately 
as we shall also refer to them in Section [6l Both of these consider an e-regular cluster 
tournament G on k clusters with the property that for some cluster V, the density of edges 
leaving Vi and the density of edges entering Vi are each roughly 1/2. Lemma 14.41 considers 
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the case where for many clusters Vj the density of edges between Vi and Vj is large in both 
directions, showing that in this case G contains a copy of a directed tree T of the type 
considered. Lemma 14.51 considers the alternative, namely that for almost all clusters Vj the 
density of edges between Vi and Vj is small in one direction, showing that in this case G 
contains a copy of T provided that Ta has large inweight and large outweight. 

Lemma 4.4. Suppose that l/n < l/A',/3 <1/A;<e<7<a«; 1/A < 1. Let T be a 

directed tree on n vertices with |7a'| < /3n and |Ta| > 2, and let G be an e-regular cluster 
tournament on clusters Vi, . . . ,Vk, each of size m > 2(1 — j)n/k. Suppose also that for 
some i £ [k] we have 



(1-37)*; , , (l-37)fe 
dij > and d ji ^ 9 • 



je[k]\{i} je[k]\{i} 

and also that there are at least ak values of j £ [k] \ {i} such that dij > a and dji > a. 
Then G contains a copy of T. 

Proof. Fix such a value of i, and introduce a new constant e' with e <C e' <C 7. Since 
A < A', we must have T\ C Ta'. Also, since |Ta| > 2, we may choose an edge t~ — > t + 
of Ta , which therefore is also an edge of T&> . Let T + and T~ be the two components formed 
when this edge is deleted from T, labelled so that t + G T + and t~ € T~. Similarly, let T^, 
and T^ ; be the two components formed by the deletion of the edge t~ — > t + from Ta', 
labelled with t + £ Tt, and t~ € TT,. Then T + and T - partition the vertices of T, and there 
is precisely one edge of T between T + and T~, which is directed towards T + . Furthermore, 
since t~ —¥ t + was an edge of Ta, by Proposition 12. lf ii) we have |T + |, \T~\ > n/A. 

Let JC [fc] \ {2} satisfy | J\ > ak and also that for any j £ J we have dij > a and (fjj > a. 
Then Ylj&jdij > a 2 k and Ylj^jdji > « 2 &- By Lemma [4731 (applied with V- = % for each 
j ^ J) at most e'm vertices of Vi have fewer than 

. . <r—\ , o, tr a 2 km 

(3) y dijm — e km > a km — e km > 

jeJ 

outneighbours in UjeJ^J or f ewer than SjeJ 4/« m — e'^ni > a 2 km/2 inneighbours in 
Ujej ^jf- Also by Lemma 14.31 at most e'm vertices of Vi have fewer than 

. , v— v , , (1 — 37 — 2e')km 

(4) 2^ d^m - e'km > i '— '- > (1 - 57)71 

outneighbours in UjepfciUi} Vj or fewer than Ylje[k]\{i} 4?i m — e'fcm > (1 — 57)72 inneighbours 
in Ujg[fc]\{i} yj - Finally, at most m/2 + 1 vertices of Vi have fewer than m/4 inneighbours 
in V^. So we may choose a set S" + of m/10 vertices of Vi which do not fall into any of these 
categories. Since \Tf, \ < |Ta'| < j3n < m/30, by Theorem 11.31 we may embed T£, in S* + . 
Let S~^, be the set of vertices of S + occupied by this embedding of T^,, and let v + be the 
vertex to which t + was embedded. Recall that \T~\ > n/A, so 

\T + \ =n - \T~\ < (1 - 1/A)n. 

Furthermore, every component of T + — T£, is a component of T — Ta' and thus has order 
at most n/A' by Proposition 12.11 So by ([3]) and (jl]), and since 7 <C 1/A, we may apply 
Lemma 12.6( b) to extend the embedding of T^, in S^, to an embedding of T + in S^, U 
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> (i- 


- 57)7?. 




>a- 


57)71 • 


(5) 


>a- 


57)n - 



[Jj£[k]\{i}Vj so * na ^ a * l eas t a 2 n/3 vertices of {Jj^jVj are occupied by this embedding of 
T+. 

Now, at least m/4 — m/10 = 3m/20 vertices of V{ \ S~^, are inneighbours of v + . For 
each j G [fc] \ {i}, let Oj denote the number of vertices of Vj which are occupied by our 
embedding of T + , and let V- C Vj consist of those vertices of Vj which are not occupied 
by this embedding. So \V-\ = m — Oj for each j. Note that since dij + dji < 1 we have 
dij < 1 — a for each j G J. Then by Lemma |4.3| at most e'm vertices of Vi have fewer than 

dij(ra — Oj) — e'km > dijvn — e'km — f%Oj — 

j'e[fc]\{i} ie[*]\{*} ieJ ie[fc]\({i}uj) 

i^J ie[*]\({i}uj) 

^ °i + « ^ °i 

ie[fc]\W jgJ 

X + a 3 n/3 > n - ^ a,- + 2n/A' 

jeW\« ie[fc]\{i} 

outneighbours in Uje[fc]\{j} or fewer than 

dji(m — Oj) — e' km > n — Oj + 2n/A', 

je[k]\{i} je[k]\{i} 

inneighbours in UjG[fc]\{i} ^j'- So we ma y choose a set £~ of m/10 vertices of V{ \ S~^,, 
none of which fall into these two categories, and all of which are inneighbours of v + . Since 
I^A'I — l-^A'l ^ fi n < m/30, by Theorem 11,31 we may embed T^, in S~. Let <S^, be the set 
of vertices of S~ occupied by this embedding of T^, . Then since 

\T~\ = n — \T + \ < n — Oj, 

the right hand side of ([5]) is at least \T~\ + 2n/A'. Also every component of T~ — TT, 
is a component of T — Ta' (and so has order at most n/A' by Proposition I2,ll fiv)). So 
by Lemma 12,61 we may extend the embedding of T7, in S^, to an embedding of T~ in 
U Uje[fc]\{t} ^j'- Then the embeddings of T + and T~ do not overlap, and so together 
these embeddings form an embedding of T in G. □ 



Given an e-regular cluster tournament G on clusters V\ , . . . , , we define the reduced 
digraph of G with parameter d, denoted Rc(d), to be the directed graph on vertex set [k] 
in which i — > j if and only if dij > d. Observe that since dij + dji < 1 for any i and j, if 
d > 1/2 then Rc{d) is an oriented graph. 

Lemma 4.5. Suppose that 1/n <C l/A',/3 <C 1/A; <£<7<a<l £e£ T be a directed 
tree on n vertices with |2a'| < «?id Zei y and z 6e i/te outweight and inweight of T&> 
respectively. Let G be an e-regular cluster tournament on clusters V\, . . . ,Vk, each of size 
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m > 2(1 — j)n/k. Suppose that for some i E [k] we have 

, (1 - 3j)k (1 - 3j)k 
2^ d ij ^ 5 ^ ^ - 2 ' 

J'6[*]\W J'6[*]\W 

and also that there are at most ak values of j E [k] \ {i} such that dij > a and dji > a. 
Then: 

(i) There are at most 2ak values of j E [k] \ {i} such that dij < 1 — 2a and dji < 1 — 2a. 

(ii) Let R := R G (1 - 2a). Then \N+(i)\,\N^(i)\ > (1 - Wa)k/2. 

(iii) Ify,z > lAan, then G contains a copy ofT. 

Proof. Fix such an i, and introduce a new constant e' with e -C e' -C 7- For (i), note that 
since d, L j + djj < 1 for any j E [k] \ {i}, and 

^2 + d 3i) ^ C 1 - 

je[fe]\« 

there are at most 3y/^yk < afc values of j E [fc] \ {i} for which + dji < 1 — ypy. So there 
are at most 2ak values of j E [k] \ {i} for which dij < 1 — a — y/j and dji < 1 — a — y/^y, so 
(i) holds. 

For (ii), observe that by (i) we have 

(1 — 3j)k \ ^ v — v v ^ >. ^ 
- < 2^ di i - Z> di i + Z> di i + Z> di J 

ie[fe]\{i} ie[*]\{i} ieW\W ie[fc]\{*} 

dij>l—2a dij ,dji<l—2a dij<2a 

< \N+(i)\ +2ak + 2ak, 

so \N^(i)\ > (1 - 10a)fc/2. A similar calculation shows that \N~(i)\ > (1 - 10a)fc/2. 

For (iii), let iV + and N~ denote N^(i) and N^(i) respectively, and let V + := [jj eN + Vj 
and V~ := [Jj eN - Vj, so y + and V~ are disjoint. By Lemma 031 Vi contains at most e'm 
vertices with fewer than 

V c%m - e'fcm > |JV^(i)|(l - 2a)m - e'km > (1 - 10a) (1 - 2a)km/2 - e'km 

j€N+ 

> (1 - 12a - 2e')km/2 > (1 - 13a)n 

outneighbours in 1/ + and at most e'm vertices with fewer than X^jGAf- <%i m — e'&m > 
(1 — 13a)n inneighbours in V~ . Choose a set S of m/2 vertices of Vi, not including any of 
these at most 2e'm vertices. Since |T/y| < fin < m/6, by Theorem 11.31 we may embed Ta> 
in S. Let Sa 1 be the set of vertices of S occupied by this embedding of Also let T% 
be the tree formed by T/y and all of its outcomponents, and let T2 be the tree formed by 
T/y and all of its incomponents. Note that all of these out- and incomponents have order 
at most n/A' <C an by Proposition I2.1f iv). In addition |Xi| = n — z < (1 — 14a)n and 
\T^\ = n — y < (1 — 14a)n. So by Lemma 12.61 we may extend the embedding of Ta> in Sa' 
to an embedding of T\ in Sa 1 U V + . Similarly by Lemma [2.61 we may extend the embedding 
of Ta' in Sa' to an embedding of T2 in Sa 1 U V~ . Then these embeddings do not overlap 
outside Ta>, so we may combine them to form an embedding of T in G. □ 
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To finish this section we shall show how Lemma 14.11 can be used to show that Sumner's 
universal tournament conjecture holds for any large and almost-regular tournament with a 
small core tree. Actually we shall prove a slightly stronger result in this case, considering a 
tournament on fewer than 2n — 2 vertices. Later on we shall make use of the fact that we 
have a little room to spare in the order of the tournament. Much of the work for this lemma 
is done by the two previous lemmas. 

Lemma 4.6. Suppose that 1/n <C l/A',/3 <C 7 "C 1/A <C 1. Let T be a directed tree on n 
vertices such that |2a'| < /3n and |Ta| > 2. Let G be a 7- almost-regular tournament on at 
least (2 — 7)71 vertices. Then G contains a copy ofT. 

Proof. Introduce new constants e,e' ,a, M, and M' with 

1/n « l/A',/3 < 1/M < 1/M' < e < e' < 7 < a < 1/A < 1. 

If I C I > (2 + 7)71, then G contains a copy of T by Theorem II. 2 f i). So we may assume that 
\G\ = (2± 7 )n. Observe that d+(v),d'(v) > (1 - 7 )(|G| - l)/2 > (1 - 2 7 )n for all v £ G. 
Since A < A', we must have Ta Q Also, since |Ta| > 2, we may choose an edge 

—> t + of Ta, which must also lie in Ta'. Let T + and T~ be the two components formed 
when this edge is deleted from T, labelled so that t + G T + and t~ G T~. Similarly, let 
and T^, be the two components formed by the deletion of the edge t~ — > t + from Ta' , 
labelled with i + G T^, and t~ G TT,. Then T + and T~ partition the vertices of T, and there 
is precisely one edge of T between T + and T~, which is directed towards T + . Furthermore, 

|r+|,|r-| > n/A. 

Let disjoint subsets V\, . . . , and a subgraph G* C G satisfy the conditions of Corol- 
lary H2J So M' < k < M, and G* is an e-regular cluster tournament on clusters Vi, . . . , Vk 
of equal size m, where 

2(1 - 7 )n (2 - 7)71 - 3en (2 + 7)71 

W fc " k fe 

Also, for each w G G* we have d~Q*(v) > <Iq(v) — 2e\G\ > d^(v) — ben and d^(v) > 
dQ(v) — ben. So for each i S [k] we have 

m E E ^s^^^E 5 ^ 

ie[*]\{t} je[k]\{i} 

■sp -ben-m > (1 - 27)71 - 5en - m © (1 - 37)^: 

— m 2 _ m _ 2 

and similarly Eje[fc]\{i} d ji > i 1 ~ 3 7 )A;/2. 

So if there exists some 7 G [fc] for which there are at least ak values of j G [fc] \ {7} such 
that c?jj > a and dji > a, then by Lemma 14.41 we may embed T in G*, and therefore in G. 
So we may assume that for each i G [k] fewer than ak values of j G [k] \ {i} satisfy dij > a 
and dji > a. Then by Lemma 14.51 we may assume that R := -Rg(1 — 2a) has 

(8) 5°(R) > (1 - 10a)fc/2. 

Let 7/ be the number of vertices in outcomponents of Ta', and let z be the number of vertices 
in incomponents of Ta', so y + z + |Ta'| = ra. So if y, z > 14an then G* (and therefore G) 
contains a copy of T by Lemma 14.51 We may therefore assume without loss of generality 
that z < Wan. 
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Now, since \R\ = k we may choose a vertex i G R with d^(i) < k/2. Then we may 
choose a vertex j G N^(i) with at most dj t (i)/2 outneighbours in N^(i). So i — > j and 
\N£(i) n A^(i)| < k/4. For this choice of i and j, let 

A:=N+(i)nN+(j), 
B:=N+(i)\N+(j), 
C:=N+(j)\N+(i). 

Then and C are disjoint, and \B\,\C\ > k/2 - 5ak - \A\ > k/A - 5ak by ©. Now, 
choose a set S + of m/2 vertices of Vj such that each vertex v G S + has 

(i) at least m/2 inneighbours in Vi, 

(ii) at least X^eA 6 ^ 771 ~~ e'km > (1 — 2a)m|^4| — e'km outneighbours in (J^eA an d 
(hi) at least Yli^c djim — e'km > (1 — 2a)m\C\ — e'km outneighbours in Ufec 

We can be sure that such a choice is possible, as by Lemma 14,31 there are at most 2e'm 
vertices of Vj which fail either of (ii) and (iii), and since Cr*[Vi — > Vj] is e-regular with density 
dij > 1 — 2a there are at most em vertices of Vj which fail (i). Then since \Tt/ | < fin < m/6, 
by Theorem 11.31 we can embed in S + . Let v + be the vertex to which t + is embedded. 
Then v + has at least m/2 inneighbours in Vi. Choose a set S~ of m/3 of these inneighbours 
so that every vertex v G 5~ has at least 

(9) 2^ d«m - e'fcm > (1 - 2o)m|iV+(i)| - e'km > (1 - 13a)n 

£eAufi=7V+(i) 

outneighbours in U^gAuB Again we can be sure that such a choice is possible, since by 
Lemma 14.31 at most e'm vertices of Vi fail this condition. Then since \T7,\ < (3n < m/9, by 
Theorem [L3] we can embed T^, in S~ . Let S^, and S*^, be the sets of vertices of G occupied 
by T^, and T~^, respectively. 

Let T3 be the tree formed by and all of its incomponents. Let T4 be the tree formed 
by T£, and all of its outcomponents, and let T5 be the tree formed by T7, and all of its 
outcomponents in T~ (i.e. all of its outcomponents except T + ). Note that T3UT4UT5 = T. 
Then \Ts\ = |T/\'| + z < 15cm, IT4I < |T + | < n — | T | < (1 — 1/A)n, and similarly 
\T§ I < (1 — 1/A)n. Every vertex of G has at least (1 — 27)71 inneighbours in G, so by 
Lemma 12.6( c) we may extend the embedding of in fifj, U S~^, to an embedding of T3 in 
G. For each £ E [k] \ {i}, let 7/ C 7| consist of the vertices of Vg which are not occupied by 
this embedding. 

By (ii) and (iii), every vertex of S^, then has at least (1 — 2q)(| j 4| + |C|)m — 2e'km — 
\Ts\ > (1 — 28o)n outneighbours in U^gAuc V'l ( nere we a l so use the fact that \A\ + \C\ = 
> (1 - 10a)fc/2 by ©). Since also 1/A' < a < 1/A and every component 
of T4 — T^, has order at most n/A', by Lemma 12.61 we may extend the embedding of 
Tj, in 5^, to an embedding of T4 in S 1 ^, U U^eAuc* ^7- Furthermore, since every vertex 
of S^, has at least (1 — 2a)|C|m — e'km — \T%\ > n/A outneighbours in U^eC^' an< ^ 
IT4 — T^,| = |T + — Ts\ > n/2A, by Lemma 12.6( b) we can ensure that this embedding of T4 
occupies at least n/2A vertices of U^ec Vl- So crucially at most \TA — n/2A vertices of T4 
are embedded in U^gAub For eacn ^ G ^4 U -B, let V/' C consist of those vertices which 
are not occupied by the embedding of T3 and T4. 
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Finally, by ([9]), every vertex of S^, has at least 

(1 - 13a)n - (|T 4 | - n/2A) - |T 3 | > n - \T A \ + n/3A 

outneighbours in U^eAuB Yt'' Since \T^ — T^,\ < n — \T&\, by Lemma l2T6T c) we can extend the 
embedding of T^, in S^, to an embedding of T 5 in S^, U U^eAuB ■ Then the embeddings 
of T3, T4 and T5 do not overlap outside S\, U S^,, and so together form an embedding of T 
in G. ~ □ 



5. Embedding trees in robust outexpander tournaments 

Let G be a tournament on n vertices, and let fi < v be positive constants. Then the 
robust outneighbourhood RN+(S) of a set S C V(G) is the set of vertices of G with at least 
/in inneighbours in S. We say that G is a robust (^,1/)- outexpander if for any S 1 C V(G) 
with vn < \S\ < (1 - i/)n we have |iUV+(5)| > |5| + /un. 

If a tournament G is not a robust outexpander, then the following lemma shows that G 
contains two subtournaments which partition the vertices of G and which have almost all 
edges between them directed the same way. 

Lemma 5.1 ([10]. Lemma 2.8). Suppose that 1/n <C u <C v, that G is a tournament on 
n vertices and that G is not a robust (n,v)- outexpander. Then we can partition V{G) into 
sets S and S' such that vn < \S\, \S'\ < (1 — v)n and e(G[S — > S']) < 4fj,n 2 . 

By iterating this split, we obtain a decomposition of G into sets Si which either induce 
robust expanders or are small, and where for all i < j, almost all edges are directed from Si 
to Sj. (So if all the Si are small, then G is close to being a transitive tournament.) We will 
use this decomposition in Section [7] to prove Theorem 11.11 

Lemma 5.2. Suppose that 1/n <C jii < f < i] < 7 < 1. Let G be a tournament on n 
vertices. Then we may choose disjoint subsets Si, ■ ■ ■ , S r of V(G) such that: 

(i) m EH £|>(l-7)n, 

(ii) for each i G [r], any vertex v G Si has at most 772 inneighbours in [jj^Sj and at 
most jn outneighbours in [Jj^Sj, and 

(hi) for each i G [r], either G[Si] is a robust (/U,z/)- outexpander with 5°(G[Si]) > nn or 
\Si\ < 771. 

Proof. We shall use a modified version of an algorithm from [10] , which keeps track of 
an ordered family S T of disjoint subsets of V(G), and a set B T of bad edges of G, at each 
time r. The analysis of this algorithm is also similar to the analysis in [10J. Initially, let 
S 1 := (V(G)), and let B 1 := 0. Then at time r > 1, we have S T = . . .,£%), and the 
algorithm proceeds as follows. 

(1) Let SJ be the largest member of S T which is not a robust (/j, i/)-outexpander with 
S°(G[SJ]) > rjn. If there is no such member of S T , or if |*ST| < 771, then terminate. 
If there is more than one largest such member, then choose one of these arbitrarily. 

(2) If some v G SJ has g^^t] (v) < rjn, then let 

S T+1 := (SI, SU,SJ \ W, W, SJ +1 , . . . , SJ), 
let B T+l := B T U E({v} -> SJ \ {v}), and proceed to step (5). 
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(3) Similarly, if some v G SJ has d G ^ s ^ (v) < nn, then let 

:= (si, su,{v}, sj \ {v}, sj +1 , . . . , sj), 

let B T+l := B T U E(SJ \ {v} -> {«}), and proceed to step (5). 

(4) If G[(ST] is not a robust (/i, i/)-outexpander then apply Lemma 15.11 to partition the 
vertices of SJ into sets 5" and S" such that v\SJ\ < \S'\,\S"\ < (1 — u)\SJ\ and at 
most 4u\SJ\ 2 edges of G[SJ] are directed from S" to S'. Then let 

O .— I^Oj , . . • , >Jf_l! »J i »J i • • • ' ) 

and let B T+1 := B T U E(S" -> S r ). 

(5) Finally, for each i G [r + 1], delete from SJ +1 any vertex u which lies in more than 
^/fjn edges of B T+l . 

At any time r, if the algorithm does not terminate at step (1) then SJ will be split in 
precisely one of steps (2), (3) and (4). So at each time r, either the algorithm terminates 
or |5 r | increases from r to r + 1 (in forming 5 T+1 ) by reducing the size of the largest piece. 
Therefore the algorithm must terminate at some time T en d < n. Take r := T en d, and S{ := S|' 
for each %. Then since the algorithm terminated at step (1) of time r, (iii) must hold. 

To see (i), observe that the split in step (4) will occur for at most \j^v times r < T en( i- 
This is because any set obtained by a split in step (4) must have size at least ^vn (since 
\SJ\ > 771, and the sets S', S" obtained have \S'\, \S"\ > u\SJ\). Also, at each time r < r en( i, 
the number of edges added to form B T+1 from B T is at most rjn if the algorithm carried out 
the split in step (2) or (3), and at most 4/xn 2 if the algorithm carried out the split in step 
(4). Since T en( i < n, and the split in step (4) is carried out in at most 1/7^ steps, we must 
have 

\B Tend \ < r/n 2 + 4/in 2 /^7 < 2nn 2 . 

Since B 1 C • • • C B Tend , any vertex of G which was ever deleted in step (5) must lie in at 
least y/rjn edges of B Tend , and so at most 4y / r/n < ^yn vertices of G can have been deleted in 
step (5) over the entire course of the algorithm. But any vertex which was not deleted lies 
in some Si, and so (i) holds. 

Finally, for (ii) fix any i G [r] and any v G Si. Observe that all edges directed from v to 
Uj<i &j an d ah edges directed from |Jj>i Sj ^° v are contained in B r . This means that there 
are at most y/rjn such edges, as otherwise v would have been deleted in step (5) at some 
point. Since i and v were arbitrary, (ii) must hold. □ 

We now consider the case when G is a robust outexpander. Lemma 4.1 of |10j stated that 
if T is a directed tree on n vertices, and G is a robust outexpander tournament on at least 
(2 + a)n vertices with large minimum semidegree, then G contains a copy of T. However, 
in the proof of this lemma, the an error term was only needed in the case when Ta is small. 
Indeed, in this section we modify this proof to show that Sumner's universal tournament 
conjecture holds for such G in the case when Ta is large. This is the following lemma. 

Lemma 5.3. Suppose that 1/n < 1/A <ju<2/<?)<7< j 3<l. Let T be a directed 
tree on n vertices such that \T&\ > (3n, and let G be a robust (fi, v)- outexpander tournament 
on at least (2 — 7)71 vertices, with S°(G) > n\G\. Then G contains a copy ofT. 

Before we can present the proof of this lemma, we must give some definitions from |10| . 
Let V\, . . . , Vk be disjoint sets of equal size. A digraph G on vertex set V%, . . . , Vk is a e-regular 
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d-dense cycle of cluster tournaments if for each i, G\Vi\ is a tournament and G[Vi —> Vi+x] 
is e-regular with density at least d (where addition on the index of V^+i is taken modulo 
k). The following lemma from [TDJ (an immediate consequence of two results from |12| ) will 
help us to find such digraphs. 

Lemma 5.4 ([10], Lemma 2.7). Suppose that 1/n < 1/M <C l/M 1 <C e <C d <C /i < 

*C ^ *C 1. £e£ G be a tournament on n vertices which is a robust (n,v)-outexpander with 
S°(G) > rjn. Then G contains an e-regular d-dense cycle of cluster tournaments on clusters 
Vi,..., V k , where \ [f i=l V\ > (1 - e)n, and M' <k<M. 

Let T be a directed tree. Then the distance between vertices u,v E T, denoted d(u, v), is 
the length of the shortest path connecting u and v in the underlying graph T un d er - Similarly 
for a set X of vertices of T, the distance d(u, X) is the minimum of d(u, x) taken over all 
vertices x E X. If T is a rooted tree with root r, then the children of a vertex u E T are 
those neighbours v of u for which d(r, u) = d(r, v) + 1. 

Let T be a tree on n vertices, rooted at ti, and let H C V(T). Also let fc be a positive 
integer. For any vertex x E T, there is a unique path in T from x to t±] let P x denote the set 
of the first k vertices of this path, starting from x. Let H 1 := Uze-ff^' an( ^ then for each 
z > 1 let H l+1 be formed from H l by adding the vertices of P x for any x E H l with at least 
two children in H % . After at most n steps we must have H l = H l+1 , when we terminate 
the process. We refer to this final W as H with leading paths included, denoted Vk(H). So 
H C Vk{H) C V(T). Note that V k {H) depends on both the value of k and the root ti of T. 

We may now present the key lemma from [10] we shall use to prove Lemma [5 .31 This says 
that a directed tree of bounded degree can be embedded in a robust outexpander tournament 
of large minimum semidegree such that the vertices in a small set H of vertices of T are 
embedded within a chosen set U C V(G). 

Lemma 5.5 ([ID]. Lemma 4.6). Suppose that 1/n < 1/A, 1/fe <e<c?<a, A< 1/2, 
i/ia£ m := n/A;, t/iot A < a/4 and i/iai 5 := d\/8k. Let T be a directed tree on n vertices 
rooted at t\ and with A(T) < A. Let H C V(T) be such that \H\ < Sn/7k and \ {x E T: 1 < 
d(x,Vk(H)) < fc 3 }| < <5n. Let G 6e an e-regular d-dense cycle of cluster tournaments on 
clusters V\, . . . , Vk, each of size (1 + a)m, and let U C V\ U • • • U Vk have size \U\ > An. 
Then T can be embedded in G so that each vertex t E H is embedded to some u E U . 

We will also use the following lemma, again from [10] . This shows that we can extend Ta 
to an 'extended tree' T ex t, with desired properties. We will apply Lemma 15.51 to T ex t and 
embed H within a set U of vertices of high in- and outdegree. 

Proposition 5.6 ([ID]. Lemma 4.5). Suppose that 1/n, 1/A* < 1/A, < 1. Let T be 

a directed tree on n vertices. Choose any vertex t\ E Ta as the root of T . Then there exists 
a subtree T ext of T and a subset H C V(T ext ) which satisfy the following properties. 

(i) T A C T ext . 

(ii) A(T ext ) < A*. 

(iii) For any edge e between T — T ext and T ext , the endvertex of e in T ext lies in H. 

(iv) The number of vertices v E T ext which satisfy 1 < d(v,Vk{H)) < k 3 is at most con. 

(v) \H\ < n/A fcl/ ". 

The final lemma we shall need to prove Lemma 15.31 gives standard Chernoff-type bounds 
for the binomial and hypergeometric distributions. The binomial random variable X with 
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parameters (n,p) is defined to be the number of successes in n independent trials, each of 
which has probability p of success. So EX = np. The hyper geometric random variable Y 
with parameters (n, m, k) is defined as follows. Let N be a set of size n, and fix a set S C N 
of size IS" | = m. Now choose a set T C iV of size |T| = uniformly at random. Then 
y = |T n 5|. Note that EY = km/n. 

Proposition 5.7 ([9], Corollary 2.3 and Theorem 2.10). Suppose X has binomial or hyper- 

2 

geometric distribution and < a < 3/2. Then F(\X - EX\ > aEA) < 2e~"r EX . 

Proof of Lemma 15.31 We begin by introducing new constants A*, M, M' , e, d and a which 
satisfy 

l/n <C 1/A* < 1/M <C 1/M', 1/A «E«d«/i«K<r)«7«a«)3«l. 

Now, if |G| > (2 + 7)71, then by Theorem |1.2f i). G contains a copy of T. So we may 
assume that \G\ = (2 ± 7)72. Since G is a robust (u, i/)-outexpander with <5°(G) > rj\G\, 
Lemma 15.41 implies that G contains an e-regular d-dense cycle of cluster tournaments on 
clusters V\, ■ ■ ■ ,Vk each of equal size between (1 — e)\G\/k > (1 — e)(2 — j)m > 2(1 — 7)771 
and \G\/k < (2 + 7)™, where m := n/k and M' < k < M. So we may remove vertices 
from each Vi to obtain a 2e-regular (d/2)-dense cycle of cluster tournaments G' on clusters 
V{, ...,V£ each of size 2(1 - 7)771. So \G'\ = 2(1 - 7 )n. Let 

5 := dafi/imk. 

Choose any vertex t\ £ as the root of T. Then let T ex t and H satisfy the properties 
of Proposition 15.61 with u := 5f3. Let T\ denote the subtree of T formed by T ex t and all 
of its outcomponents, and let T2 denote the subtree of T formed by T ext and all of its 
incomponents. Since Xa Q T ex t (this is (i) of Proposition 15. 6p . all of these incomponents 
and outcomponents have order at most n/A by Proposition 12.11 Let x := \T ex t\,y := 
|?i — T ext \, z := |T2 — T ext \, so x + y + z = n. Since Ta ^ T ext , we have x > fin. Also, all 
but at most 2y + x — an/ 2 vertices of G have at least y + x/2 — an/ 4 outneighbours, and all 
but at most 2z + x — an/ 2 vertices of G have at least z + x/2 — an/ 4 inneighbours. So at 
least (2 — 7)77 — 2y — 2z — 2x + an > an/2 vertices of G satisfy both of these conditions. Let 
Uq be the set of these vertices, so \Uq\ > an/2, and each v G Uq has at least y + x/2 — an/ '4 
outneighbours and at least 2 + x/2 — an/4 inneighbours. 

From each cluster V( of G' choose a set of (l + a)x/k vertices uniformly at random, and 
let X := X\ U • • • U X^. Then |X| = (1 + a)x. For any single vertex u £ G' , the probability 
that u is included in X is (1 + a)x/\G'\ > x/2n, so by Proposition 15.71 with probability at 
least 2/3 the set U := X n Uq satisfies \U\ > ax/5 > a/3n/5. Also, for any vertex v E U, the 
expected number of outneighbours of v outside X is at least 

x an\ ( (1 + a)x\ an x (1 + a)xy (1 + a)x 2 

V + o ~ ~r 1 r^Ti — ~ -r + n 



2 4/V / 4 2 2(1-7)77 4(l- 7 )n 

an 2xn — 2xy — x 2 — 27x77 — 2axy — ax 2 
>iy 1 

~ y 4 4(l-7)n 

X 2 /3 2 77 

>y + 2an > y H 2an > y + 2an, 

4n 4 

where in the first inequality of the third line we used the fact that 2n — 2y — x > x. A similar 
calculation shows that for each v £ U , the expected number of inneighbours of v outside X 
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is at least z + 2an. So by Proposition 15.71 we find that with probability at least 2/3, every 
vertex v £ U has at least y + an outneighbours outside X and at least z + an inneighbours 
outside X. Fix a choice of X such that both these events of probability at least 2/3 occur. 

Since every vertex of U has either at least — |X|)/2 > y+z+an inneighbours outside 
X or at least y + z + an outneighbours outside X, we may choose a set U' C U of size 
l^'l > \U\/2 > af3n/W such that either 

(ai) every v £ U' has at least ?/ + an outneighbours outside X and at least y + z + an 

inneighbours outside X, or 
(«2) every v £ U' has at least y + z + an outneighbours outside X and at least z + an 

inneighbours outside X. 

So is a (2e//3)-regular (<i/2)-dense cycle of cluster tournaments on clusters X±, . . . , X^ 

of size (1 + a)x/k, and U' Q X% U • • • U X^ has size \U'\ > af3x/10. Also T ex ^ is a di- 
rected tree on x vertices rooted at t\ and with A(T ex t) < A*, and H C y(T ex t) has 
< n/A fcl//35 < fe/7fc and \{t £ T e:r i : 1 < d(t,V k (H)) < k 3 }\ < 5f3n < Sx. So by 
Lemma 15.51 (with a/3/10, A* and d/2 in place of A, A and d respectively), contains a 

copy of T ex t in which every vertex of H is embedded to a vertex of U' . 

So every vertex t £ H has been embedded to some vertex v{t) G {/'. Suppose that (ai) 
holds. Then for every t £ H, v(t) has at least y + 2n/A outneighbours outside X (and so 
unoccupied by vertices of T ex t). Since the only vertices of T ex t which may have neighbours 
in Ti — T ex t are the vertices of H, we may use Theorem 11.31 to extend the embedding of 
T ex t in G[X] to an embedding of T\ in G in the same way as in the proof of Lemma 12.61 
(we cannot just apply Lemma 12.61 as vertices of G to which we embedded T ex t — H may 
not have sufficiently many outneighbours, but since vertices of T ex t — H do not have any 
outneighbours outside T ext this does not cause any problems). Then for every t £ H, v{t) 
has at least z + 2n/A inneighbours outside X which are not occupied by this embedding 
of T\. So in the same way we may extend the embedding of T ex t in G[X] to an embedding 
of Ti in the vertices of G not occupied by T\ — T ex t- So the embeddings of T\ and T2 only 
overlap in T ex t , and so together form an embedding of T in G. If instead (02) holds we may 
embed T in G similarly by first embedding T2 then T\. □ 

We can now deduce that if G is a large almost-regular tournament and if |Ta| > 1, then 
Sumner's conjecture holds with a little room to spare (we shall need this extra room in the 
proof of Lemmas 16. 21 and I6.3p . Indeed, we shall see that a large almost-regular tournament G 
is also a robust outexpander, and so if Ta is large, then we can embed T in G by Lemma [5 .31 
On the other hand, if 7a is small but has more than one vertex, then we may embed T in 
G by Lemma 14.61 

In particular, together with Lemma 13. II (which deals with the case |TX| = 1), this means 
that at this stage, we have proved that Sumner's conjecture holds for all large almost-regular 
tournaments. 

Lemma 5.8. Suppose that l/n<C7<C 1/A ^ 1. Let T be a directed tree on n vertices with 
|7a| > 1. Then every ^-almost-regular tournament G on at least (2 — 7)77. vertices contains 
a copy ofT. 

Proof. Introduce constants /U, is, rj, A', /?, 7' such that 

1/n < 1/A' <C / Li«^«r?<7</3<7 / < 1/A < 1. 
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Let G be a 7-almost-regular tournament on at least (2—7)71 vertices. Then we shall show that 
G is a robust (/x, ^)-outexpander. Indeed, let S C V(G) satisfy v\G\ < \S\ < 2|G|/3. Then at 
least (1 — 7)|5|(|G| — l)/2 edges originate in S. At most ('f ) of these have both endvertices 

in 5, so at least (1 - 7)|5|(|G| - l)/2 - ('f') > |5|((1 - 7)(|G| - 1) - \S\)/2 > u\G\ 2 /10 
edges leave S. So at least u\G\/20 > 3/x|G| vertices outside S have at least v\G\/20 > 3fi\G\ 
inneighbours in S. At most 2/j|G| vertices of S have fewer than fi\G\ inneighbours in S, 
and so \RN+(S)\ > \S\ + n\G\, as desired. On the other hand, if S C V(G) satisfies 
2|G|/3 < |5| < (1 — ^)|G|, every vertex of G has at least \G\/7 > /x|G| inneighbours in S. 
So \RN+(S)\ = \G\ > \S\ + (i\G\, as desired. 

So G is indeed a robust (/j, i/)-outexpander. Clearly 5°(G) > r)\G\. So if |T^'| > f3n, then 
by Lemma 15.31 G contains a copy of T. So we may assume that |2~a'| < fin. But G is 
also a 7'-almost-regular tournament on at least (2 — ~f)n vertices, and so by Lemma |4.6|, G 
contains a copy of T. □ 



6. Embedding trees whose core tree is small 

We now turn our attention to the general case of the problem. As when considering 
almost-regular tournaments, we consider the problem of embedding directed trees whose 
core trees are small separately from the case when the core trees are large. In this section 
we shall consider directed trees with small core trees, proving the following lemma. 

Lemma 6.1. Suppose 1/n <C /3,1/A' <C 1. Let T be a directed tree on n vertices with 
I^A'I < Pn, and let G be a tournament on 2n — 2 vertices. Then G contains a copy ofT. 

We begin by showing that we may assume that the tournament G consists of two large 
disjoint almost-regular tournaments, with almost all of the edges between them directed the 
same way. 

Lemma 6.2. Suppose that 1/n /3,1/A <C 7 <S 77 <C 1. Let T be a directed tree on n 
vertices with |Ta| < f3n, and let G be a tournament on2n — 2 vertices. Let y be the outweight 
o/Ta, and let z be the inweight ofT&. Then the following properties hold. 

(i) If z < 7/n or y < nn then G contains a copy ofT. 

(ii) Either G contains a copy of T , or we can find disjoint sets Y,Z C V(G) such that 
\Y\ > (2 — 7)2/ and \Z\ > (2 — j)z, G\Y] and G[Z] are ^-almost-regular, any vertex 
of Y has at most 3jn outneighbours in Z and any vertex of Z has at most 3771 
inneighbours in Y. 

Proof. Introduce new constants M, M', e, e', a, 7* and A* such that 

1/n < p, 1/A < l/M < l/M' <e<e'<7<a<7/<7*< 1/A* < 1. 

Partition the vertex set of G into sets A,B,C,D,E such that: 

A C {v e G : d + (v) < y + en}, 

B C {v G G : y + en < d + (v) < n — en}, 

CC{«eG: d + {v),d~{v) >n-en}, 

D C {v G G : z + en < d~(v) < n — en}, 

EC{veG: d~(v) < z + en}. 
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These subset relations may not all be equality, for example in the case where z is very small, 
when we have y + en > n — en. However, it is clear that each vertex v G G lies in at least one 
of these five sets, so we may choose such a partition of V(G). Let x := |Ta|, so x + y + z = n 
and x < (3n. 

Suppose that \B\ > 3x. Then by Theorem II .31 we may embed Ta in G[B}. Let 5a Q B be 
the set of vertices occupied by this embedding of Ta. Then every vertex of 5a has at least 
y+en — x > y + 2re/ A outneighbours outside 5a and at least | G\ — x — (re — en) > y + z+2n/ A 
inneighbours outside 5a • Let T\ be the subtree of T formed by Ta and all outcomponents 
of Ta, and let Ti be the subtree of T formed by Ta and all incomponents of Ta- Then 
|Ti| = x + y and 1 2~2 1 = x + z. By Proposition I2.1f iv). all incomponents and outcomponents 
of Ta contain at most re/ A vertices, so by Lemma 12.6( c) we may extend our embedding 
of Ta in 5a to an embedding of T\ in G. Then each vertex of 5a still has at least z + 2n/A 
inneighbours outside 5a which are not occupied by this embedding of T%, so by Lemma l2T6lf c) 
we may also extend our embedding of Ta in 5a to an embedding of T<i in G which avoids 
vertices occupied by the embedding of T\ — Ta- Then these embeddings of T\ and T2 do 
not overlap outside Ta, and so together form an embedding of T in G. We may therefore 
assume that \B\ < 3x < 3j3n. By the same argument (embedding first T2 and then T\ in G) 
we may assume that \D\ < 3x < 3/3re. 

If |Ta* I = 1, then G contains a copy of T by Lemma f3.ll So we may assume that |Ta* | > 2. 
Now, if z < nn, then every v G E satisfies d~(v) < (rj + e)n < 2r]n, so \E\ < Ann + 1, and so 
\BUDL)E\ < Ann + l + Qfin < 5nn. Let G' := G[AuC). Then \G'\ > 2n-2-5t]n, and every 
vertex v G G' has d + (v) < n + ere. So by Proposition 12.41 G' contains a 7*-almost-regular 
subtournament G' on at least (2 — j*)n vertices. Since |Ta*| > 2, by Lemma|52]G' contains 
a copy of T, so G contains a copy of T also. If instead we have y < rjn, then we may similarly 
embed T in G[C U E\. So if z < nn or y < rjn then G contains a copy of T, completing the 
proof of (i). So for (ii), we may assume that y,z > rjn. 

Suppose now that |C| > 5e'n. Let disjoint subsets Vi,...,Vk and a subgraph G* C G 
satisfy the conditions of Corollary 14.21 So M' < k < M, and G* is an e-regular cluster 
tournament on clusters V\, . . . , of equal size m, where 

( 1 -')i°i< ro <]gi. 

k k 

We shall show that G* has the property that for some i G [k] we have 

(l-3e')k , , (l-3e')A; 

(10) Yl and E d 3^~ 2—^- 

ie(W\W) je([fc]\{<}) 
Indeed, if for some i G [k] we have Ylje([k]\{i}) ^ij < (1 — 3e')k/2, then by Lemma FOl all but 
at most e'rre vertices of have at most 

E7 /t (1 — s')km 
dijm + e te < < n — 8en 

J6(W\«) 

outneighbours in Ujg([fc]\{«}) ( m ^ ne graph G*), and hence at most n — 8en + (\G\ — \G*\) + 
\Vi\ + 2e\G\ < re — en outneighbours in G. So at most e'm vertices of Vi lie in C. Similarly if 
for some i G [k] we have X^e([fc]\{«}) 4/« < (1 — 3e')k/2 then again at most e'm vertices of Vi 
lie in C. Since |C| > 5e'n > 2e'mk + (|G| — there must be some i G [k] which satisfies 

(110p . Fix such an i. Then if at least ak values of j G [k] \ {i} have dij > a and dji > a then 
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G* contains a copy of T by Lemma 14.41 (applied with e' in the place of 7). Alternatively, if 
at most ak values of j £ [k] \ {i} have dij > a and dji > a then since y,z > nn, G* contains 
a copy of T by Lemma l4.5f iii) (again applied with e' in the place of 7). So in either case G 
contains a copy of T, and so we may assume that |C| < 5e' n. 

So to prove (ii), observe that we must therefore have \B U C U D\ < 5e'n + 6/3n < 6e'n. 
Trivially \A\ <2y + 2en + 1 and \E\ < 2z + 2en + 1, and so we must have 

\A\>2n-2- 6e'n - 2z - 2en - 1 > 2y - 7e'n, and 
\E\ > 2n - 2 - 6e'n - 2y - 2en - 1 > 2z - 7e'n. 

So by Proposition 12. 4\ G[A] contains a 7-almost-regular subtournament on at least (2 — j)y 
vertices, and G[E] contains a 7-almost-regular subtournament on at least (2 — 7)2 vertices. 
Let Y and Z be the vertex sets of these subtournaments respectively. Then any vertex of 
Y has at least (1 — 2*y)y outneighbours in Y, and so has at most y + en — (1 — 2j)y < S^n 
outneighbours in Z. Similarly any vertex of Z has at least (1 — 27)2; inneighbours in Z, and 
so has at most 3jn inneighbours in Y. So Y and Z are as required for (ii). □ 

The next lemma builds on the previous lemma and will in turn be used in the proof of 
Lemma 16.41 

Lemma 6.3. Suppose that 1/n <C (3,1/ A' < a < 1/A <C 1. Let T be a directed tree on 
n vertices with |7a'| — fin. Let y and z be the outweight and inweight o/Ta' respectively. 
Suppose that forests F~ and F + are induced subgraphs of T which partition the vertices of 
T, such that \F + \ < y + 2an, \F~\ < z — an, and every edge of T between F~ and F + is 
directed from F~ to F + . Suppose also that either 

(i) no component of F + has order greater than y — an, or 

(ii) the largest component T\ of F + has |(2i)a| > 2. 

Then any tournament G on 2n — 2 vertices contains a copy ofT. 

Proof. Let G be a tournament on 2n — 2 vertices, and let T\ and T2 be the largest and 
second largest components of F + respectively. Introduce new constants 7 and rj with 

1/n < (3, 1/A' < 7 < a <C 1/A <C 77 < 1. 

Then by Lemma [6 .21 we may assume that y,z > nn. Also by Lemma [6.2l we may find subsets 
Y,Z C V(G) such that \Y\ > (2 - -y)y, \Z\ > (2 - 7)2, is 7-almost-regular, each 

vertex of Y has at most 37n outneighbours in Z, and each vertex of Z has at most 3771 
inneighbours in Y. Then \Y\ > 3|-F + |/2 + an > |-F + | + | X2 1 + an, and \Z\ > 2\F~\ + an, 
and so by Lemma 12.71 any embedding of T\ in G[Y] may be extended to an embedding of T 
in G. 

It therefore suffices to embed T\ in G[Y]. If \T\\ < y/2 then we may do this by The- 
orem [L3l If instead |Ti| > y/2 > nn/2 and we also have (i), then |Ti| < y — an. Since 
\Y\ > (2 — 7)2/ > 2|Ti| + an we may embed T\ in G[Y] by Theorem 11.2( 1). Finally, if 
I Tl| > nn/2 and we also have (ii), then |T| < |F + | < y + 2an and |(Tl)a| > 2. Since 
7 < 9a/n, G[Y] is a 9a/r7-almost-regular tournament on at least (2 — ^)y > (2 — 9a/r?)|Ti| 
vertices, and so we may embed Tl in G[y] by Lemma 15.81 So in any case we may embed Ti 
in Gr[Y], completing the proof. □ 
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Observe that as with Lemma [2 .71 a 'dual' form of Lemma l6.3l can be proved similarly. For 
this we instead require that |T + | < y — an and \F~\ < z + 2an, and also either that no 
component of F~ has order greater than z — an or that the largest component T\ of F~ 
has > 2. If these conditions are met then we may conclude that G contains a copy 

of T. As with Lemma 12.71 we shall sometimes implicitly refer to this 'dual' when referring 
to Lemma 16.31 

In the next lemma we show that Lemma 16.11 holds for any directed tree T whose core 
tree Ta is not a directed path in which most of the outweight and inweight of Ta lies 
at the endvertices of Ta- We say that a vertex t of a directed tree T is an outleaf if t 
has one inneighbour and no outneighbours, or an inleaf if t has one outneighbour and no 
inneighbours. 

Lemma 6.4. Suppose that 1/n <C /3, 1/A' <C 1/A < cr < 1. Let T be a directed tree on n 
vertices with |T/y| < fin, and let y and z be the outweight and inweight ofT&i respectively. 
Let G be a tournament on In — 2 vertices. Then either G contains a copy of T , or Ta is a 
directed path whose outleaf has outweight at least y — an and whose inleaf has inweight at 
least z — an. 

Proof. Introduce new constants a and 77 with 

1/n <C 0, 1/A' < a <C 1/A <C a < re < 1. 

Then by Lemma 16.21 we may assume that y,z > rjn. Also, if |Ta| = 1 then G contains a 
copy of T by Lemma [37T1 so we may assume that |Ta| > 2. 

Suppose that some vertex t G T has the property that w"(t) < z — an — 1, and also that 
every outcomponent of t contains at most w + (t) — 3an = \V + \ — 3an vertices. Then let 
the set V~ consist of t and every vertex in an incomponent of t, and let V + := V(T) \ V" . 
Then \V~\ < w~(t) + 1 < z — an, and every edge of T between V~ and V + is directed 
from V~ to V + . Also, each component of T[y + ] contains at most w + (t) — 3an vertices. 
Now, select a source vertex from the largest component of T[V + ], delete this vertex from 
V + , and add it to V~. Repeat this step until we have \V + \ < y + 2an and |V _ | < z — an. 
For these final V + and V~, let F + := T[V + ] and let F~ := T[V~]. Then F~ and F + are 
forests which partition the vertices of T, with \F + \ < y + 2an and \F~\ < z — an. Also, 
every edge of T between F~ and F + is directed from F~ to F + . Finally, since we always 
deleted a vertex from the largest component of T[V + ], no component of F + contains more 
than \F + \ — 3an <y — an vertices. So by Lemma l6.3( i) G contains a copy of T. So we may 
assume that 

there is no vertex t £ T such that w~ (t) < z — an — 1 and every outcomponent of 

t contains at most w + (t) — 3an vertices. In particular, this implies that for every (f) 

inleaf t of Ta, at least n/2A vertices ofT lie in incomponents of t. 

Indeed, if Ta contains some inleaf t such that fewer than n/2A < z — an — 1 vertices of T 
lie in incomponents of t, then by the definition of Ta at least n/2A — 1 vertices of T lie in 
outcomponents of t other than the outcomponent containing the remaining vertices of Ta- 
Moreover, the definition of Ta also implies that at least re/ A vertices of T lie in the one 
component of T — t containing Ta — t. Altogether this shows that every outcomponent of t 
contains at most w + (t) — re/2 A + 1 < w + (t) — 3an vertices, a contradiction. By the same 
argument with the roles of incomponents and outcomponents switched, we may assume that 
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there is no vertex t £ T such that w + (t) < y — an — 1 and every incomponent of t 
contains at most w~(t) — 3an vertices. It follows from this that for every outleaf t (ft) 
of Ta, at least n/2A vertices ofT lie in outcomponents of t. 

Claim. // Ta has at least two inleaves or at least two outleaves, then G contains a copy 
ofT. 

To prove the claim, suppose that Ta has two outleaves t and t' (the proof for inleaves is 
similar) . Then we shall form a set V + of size between n — z + an and y + 2an such that any 
edge of T between V + and V~ := V(G) \ V + is directed from V~ to V + . We may do this 
by repeatedly selecting a sink vertex of T, adding it to V + and removing it from T. Now, 
by (ff) at least n/2A vertices lie in outcomponents of t, and at least n/2A vertices lie in 
outcomponents of t' . Furthermore, if T' is an outcomponent of t, then any sink vertex in 
T' is a sink vertex in T, and the same is true if T' is instead an outcomponent of t'. So we 
may form V + and V~ as described above so that additionally V + contains at least re/2 A 
vertices from outcomponents of t and at least n/2A vertices from outcomponents of t'. Fix 
such a choice of V + and V~ , and let T + := T[V + ] and F~ := T[V~] be the induced forests. 
Then |T + | < y + 2cm and |T~| = re — |T + | < z — an, and every edge of T between F~ 
and F + is directed from F~ to T + . So if every component of T + contains at most y — an 
vertices, then G contains a copy of T by Lemma l6.3( i). We may therefore assume that the 
largest component T + of F + contains more than y — an > |T + | — re/4A vertices. Since F + 
includes at least n/2A vertices from outcomponents of t and at least n/2A vertices from 
outcomponents of t', it follows that T + contains at least n/4A vertices from outcomponents 
of t and at least n/4A vertices from outcomponents of t'. As a consequence T + must contain 
t and t' . Furthermore, we must have t,t' £ (T + )4A, and so |(T + )4a| > 2. So G contains a 
copy of T by Lemma 16.3( h). which proves the claim. 

We may therefore assume that Ta has at most one outleaf and at most one inleaf. So Ta 
is a path with one inleaf and one outleaf. Let ti,...,t x be the vertices of this path, labelled 
so that t\ is the inleaf of Ta (so t\ — > £2)1 t x is the outleaf of Ta (so t x -\ — > t x ), and for each 
i € [x — 1] there is an edge of Ta between ti and ti + \. 

Now suppose that the inweight of Ta is less than z — 2an. Let the set V~ consist of all 
vertices of T which lie in Ta or in incomponents of T\ . Then | V ~ \ < z — 2an + \ Ta | < z — an 
(since |T A | < |T A '| < /3n). Also, every edge of T between V~ and V + := V(T) \ V~ is 
directed from V~ to V + . Choose a source vertex of T[F + ], delete it from V + , and add 
it to V~, and repeat this step until we have \V~\ < z — an and |^ + | < y + 2an. For 
these final V~ and V + , let F + := T[V + ] and F~ := T[V~] be the induced forests. Then 
\F~\ < z — an, \F + \ < y + 2an, and every edge of T between F~ and F + is directed from 
F~ to F + . Also, every component of F + is contained within a component of T — Ta, and 
so has order at most nj A < y — an by Proposition 12.11 So G contains a copy of T by 
Lemma l6.3( i). We may therefore assume that the inweight of Ta is at least z — 2an, and 
by a similar argument we may also assume that the outweight of Ta is at least y — 2an. It 
follows that the outweight of T\ is at most n — (z — 2an) < y + 3an and that the inweight 
of Ta is at most n — {y — 2an) < z + 3cm. 

We now suppose that fewer than y — an vertices of T lie in outcomponents of t x . Let 
Ti be the subtree of T formed by Ta and all of its outcomponents. Initially let the set 
V+ := V(Ti), so |V + | < y + Aan, and every edge of T between V + and V~ := V(G) \ V + 
is directed from V~ to V + . Choose a sink vertex of T[V - ], delete it from V~ and add 
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it to V + , and repeat this step until we have |^ + | < y + 4cm and |V~| < z — 2an. Fix 
these final V + and V~ and let F~ := T[V _ ] and F + := T[F + ] be the induced forests. So 
|T + | < V + 4an, |T~| < z — 2cm, and every edge of T between F~ and F + is directed from 
T~ to F + . Also Ti C F + , so Ti is contained within a single component T + of F + . Since 
at least y — 2an vertices of T lie in outcomponents of Ta, at least an/2 vertices of T lie in 
outcomponents of Ta other than the outcomponents of t x . Moreover, since t x is an outleaf 
of Ta, by (ff) at least n/2A vertices lie in outcomponents of t x . So t x -i G (T + )2A and 
t x G (T + )2A, and so | (T + )2a| > 2. But since the outweight of Ta is at least y — 2an we have 
|T + | > |Ti| > y — 2an, and so T + must be the largest component of F + . So G contains a 
copy of T by Lemma 16.3( h) . 

So we may assume that at least y — an vertices of T lie in outcomponents of t x , as desired. 
If fewer than z — an vertices of T lie in incomponents of t\, then we may similarly embed 
T in G, so we may also assume that at least z — an vertices of T lie in incomponents of 
t\. So at most 3an vertices of T do not lie in incomponents of t\ or outcomponents of t x . 
It remains only to show that Ta is a directed path. So suppose for a contradiction that 
Ta is not a directed path. Then there is some i G [x — 1] such that ti <— ii+i. Choose the 
minimal such i (note i > 1 as t\ is an inleaf of Ta). Then ti has two inneighbours and no 
outneighbours in Ta- So at least two incomponents of ti contain at least n/A vertices, and 
so no incomponent of t{ contains more than w~{ti) — n/A < w~(ti) — 3an vertices. Also, at 
most 3an <y — an — 1 vertices of T lie in outcomponents of ti, contradicting (ft)- ^ 

We can now prove that Sumner's universal tournament conjecture holds for any large 
directed tree T whose core tree Ta contains precisely two vertices. 

Lemma 6.5. Suppose that 1/n <C 1/A' <C 1. Let T be a directed tree on n vertices with 
|Tv| = 2, and let G be a tournament on 2n — 2 vertices. Then G contains a copy ofT. 

Proof. Introduce new constants A, e, 7 and n with 



Then |Ta'| = 2 < f3n. Also, since A < A' we have Ta Q Ta'. If |Ta| = 1, then by Lemma f3.ll 
G contains a copy of T. So we may assume that Ta = Ta'. Let ti and t\ be the vertices of 
Ta, labelled so that ti — > t±. Let y be the outweight of Ta, and let z be the inweight of Ta, 
so y + z = n — 2. Then by Lemma 16.41 (with e in the place of a), we may assume that ti 
has inweight at least z — en, and also that t± has outweight at least y — en. Let Ti be the 
subtree of T consisting of all vertices which lie in Ta or in outcomponents of Ta , and let T2 
be the subtree of T consisting of all vertices which lie in Ta or in incomponents of Ta- So 
= U + 2 and \Ti\ = z + 2. By Lemma l6.2( i) we may assume that y,z > nn. 
As in the proof of Lemma 16.21 we partition the vertices of G into sets A, B, C, D and E, 
where: 



1/n < (3, 1/A' < 1/A <e<7<r/<l. 



A 



{veG 



d + (v) < y + en} 



D 



E 



B 



C 



{v G G 
{v G G 
{v G G 
{v G G 



y + en < d + (v) < n — en}, 
d + (v) , dT (v) > n — en} , 
z + en < dT (v) < n — en}, 
d~{v) < z + en}. 
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Since y,z > nn and e <C n this is indeed a partition. Suppose first that \B\ > 2. Then we 
may embed T\ in £?[£?]. Let 5a Q B be the set of vertices occupied by Ta. Then every 
vertex of Sa has at least y + en — 1 > y + 2n/A outneighbours outside Sa and at least 
|G| — 2 — (n — en) > y + z + 2n/A inneighbours outside 5a- So by Lemma l2"Ul c) we may 
extend the embedding of Ta in Sa to an embedding of T\ in G. This embedding of T\ 
occupies at most y vertices of G outside Sa, and so we may apply Lemma [2TBT c) again to 
extend the embedding of Ta in Sa to an embedding of T2 in G so that the embeddings of 
Ti and T2 do not overlap outside Ta- Then together the embeddings of T\ and T2 form an 
embedding of T in G. So we may assume that \B\ < 1. If \D\ > 2 we may embed T in G 
in the same way by embedding Ta in D and then extending this embedding to embeddings 
of first T2 and then T\ in G which do not overlap outside Ta . So we may also assume that 
\D\ < 1. 

Now suppose that |C| > 3. Then we may choose vertices i>2,i>i € C with V2 — > v\ 
and |iV + (ux) H -/V + («2)| > nn > nn/2 + 2n/A. Embed ti to t>i and £2 to ^2- Then since 
I N + (v\)\, I A^ + (w2)| > n — en > y + 2n/A, by Lemma 12.6( b) and (c) we may extend the 
embedding of Ta in {v\, V2} to an embedding of T\ in G so that at least nn/2 vertices of T\ 
are embedded in N + {v\) n N + (y2)- Then at most y + 2 — nn/2 vertices of N~{v\) U N~{v2) 
are occupied by this embedding, and so in each of N~(v\) and N~(v2) at least n — en — (y + 
2 — nn/2) > z + 2n/A vertices remain unoccupied. So by Lemma 12.6( a) and (c) we may 
extend the embedding of Ta in {yi, 1)2} to an embedding of T2 in G which does not overlap 
with the embedding of T\ outside T\. Then together these embeddings form an embedding 
of T in G. So we may assume that \C\ < 2, and hence that \A U E\ > 2n — 6. 

Claim. Either some vertex of A has at least y outneighbours in AD B U D or some vertex 
of E has at least z inneighbours in B U D U E. 

Indeed, suppose for a contradiction that both of these statements are false. Then certainly 
every vertex of A has fewer than y outneighbours in A and every vertex of E has fewer than z 
inneighbours in E. So \A\ < 2y — 1 and \E\ < 2z — l. Since y + z = n — 2 and |AUi?| > 2n — 6, 
we must have \A\ = 2y — 1 and \E\ = 2z — 1, and also \B\ = 1, \D\ = 1 and \C\ = 2. Then 
every vertex of A must have y — 1 outneighbours in A, and so no vertex of A can have an 
outneighbour in B or in D. Likewise, every vertex of E must have z — 1 inneighbours in E, 
and so no vertex of E can have an inneighbour in B or in D. But then if we let b be the 
vertex in B and d be the vertex in D we have d + (b) = d + (d) ± 3, contradicting the definition 
of B and D. So either some vertex of A has at least y outneigbours in A U B U D or some 
vertex of E has at least z inneighbours in B U D U E. This completes the proof of the claim. 

If some v £ A has at least y outneighbours in AUBUD, then we shall embed T\ in G[A] so 
that we may then embed the incomponents of t2 and t\ in the unoccupied vertices of E and 
A respectively. For this, note that \E\ < 2(z + en) + 1, so |^4| > 2n — 2z — 2en — 7 > 2y — 3en 
(and similarly we have \E\ >2z — 3en). Since every a £ A has at most y + en outneighbours 
in A, by Proposition 12.41 G[A] contains a 7-almost-regular subtournament on at least (2 — j)y 
vertices. Let Y be the vertex set of this subtournament. Now, 

I (A U B U D) \ Y\ < 2 + (2y + 2en + 1) - (2 - j)y < 2 7 y, 

so v must have at least (1 — 2"f)y outneighbours in Y. Also, since v G A we have 

(1 - 2j)y < \N + (v) n Y\ < y + en < (1 + 2 7 )y. 
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So at most 10jy vertices of N + (v)(~)Y have more than (1 — Sj)y outneighbours in N + (v)riY , 
and at most 10jy vertices of N + (v)riY have more than (1 — 37)y inneighbours in N + (v)riY . 
Since every vertex of Y has at least (1 — 2 r y)y inneighbours in Y and at least (1 — 2^)y 
outneighbours in Y, this means that at least \N + (v)DY\ — 20 r yy > 3n/A vertices of N + (v)riY 
have at least 7?/ > 6n/A outneighbours in Y \ N + (v) and at least 6n/A inneighbours in 
Y \ N + (v). Let T + be the tree formed by t\ and its outcomponents, so \T + \ < y + 1. 
Then every component of T + — t\ is a component of T — Ta and so has order at most 
n/A by Proposition ED So by Lemma [23] (applied with N := N + (v) n(AU BU D) and 
X := Y \ N + (v)), we may embed T + in G[A USUD] so that t\ is embedded to v and at 
most 4n/A vertices are embedded outside N + (v). 

Since v £ A we have < y + en, and so v has at least 

(11) \Y\ - 1 - (y + en) -in/ A > 7en 

inneighbours in Y which are not occupied by the embedding of T + . Let T* be the tree formed 
by all vertices of T which do not lie in outcomponents of t\ or incomponents of t<i- Then every 
edge incident to t\ in T* is directed towards t\. Also, \T*\ < n — (y — en) — (z — en) = 2en + 2, 
so certainly every component of T* — 1\ has order at most 2en + 1. Together with (jlip and 
Theorem 1 1 . 3 1 this shows that we may extend the embedding of t\ in {v} to an embedding of 
T* in {v} U (N~ (v) n Y) so that the embeddings of T + and T* only overlap in the vertex 
t\. Then in particular t2 is embedded to some vertex V2 £ Y. 

To complete the embedding, observe that every vertex of Y has at least (1 — 2'j)y out- 
neighbours in Y, and therefore at most 3*yy outneighbours outside Y. So V2 has at least 
\E\ — 3jy > z + 2n/A inneighbours in E, none of which have been occupied by the embed- 
dings of T + and T*. Let T~ be the subtree of T consisting of ti and all of its incomponents. 
Then |T~| < z + 1, and each component of T~ — ti is a component of T — and so has order 
at most n/A by Proposition 12.11 So by Lemma 12.61 (c) we may extend the embedding of t<i 
in {^2} to an embedding of T~ in {^2} U E. These embeddings together form an embedding 
of T in G. 

If instead some v G E has at least z inneighbours in B U D U E then we may similarly 
embed T in G by choosing Z to be the vertex set of a 7-almost-regular subtournament 
of G[E] on at least (2 — 7)2; vertices and embedding T~ in G[B U DU E], then embedding 
T* — t2 in the unoccupied vertices of Z, before finally embedding T + — t\ in GL4]. □ 

We can now give the proof of Lemma [6. 11 It was necessary to prove Lemma [6.51 separately 
from this as the method of proof does not hold for \T&\ = 2 (we cannot obtain the partition 
of V(G) into Y* and Z* in this case). 

Proof of Lemma 16.11 Introduce new constants 7, a, A and r/ with 

1/n < /3, 1/A' < 1/A <7<a<7/<l. 

Let y' be the outweight of T&i and let z' be the inweight of Ta'. Then by Lemma 16.21 we 
may assume that y', z' > r/n. Similarly let y and z be the outweight and inweight of Ta 
respectively. If |Ta| = 1, then G contains a copy of T by Lemma 13.11 If instead |Ta| = 2 
then G contains a copy of T by Lemma 16.51 So we may assume that t := |Ta| > 3, and by 
Lemma [6741 we may assume that Ta is a directed path. Let t\,...,ti be the vertices of Ta, 
labelled so that ti — > for each i £ [I — 1]. Then by Lemma 16.41 we may also assume that 
the inweight of t\ is at least z' — jn and that the outweight of ti is at least y' — "fn. This 
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implies that z > z' — 777, and y > y' — 777. Since y' + z' + |T/y| = y + z + |Ta| = n it follows 
that we must have 

(12) y = y' ± 2772 and z = z' ± 2771. 

Finally, by Lemma 16.21 we may assume that there are disjoint sets 7,2C V(G) such that: 

(a) \Y\ > (2 - 7)2/ and \Z\ > (2 - 7)2', 

(b) G[Y] and G[Z] are 7-almost-regular, and 

(c) any vertex of Y has at most 3771 outneighbours in Z and any vertex of Z has at 
most 3771. inneighbours in Y. 

Let X := V(G)\(YUZ), so \X\ < 2777. Let T* be the subtree of T formed by deleting from T 
all vertices in outcomponents of tg_ or incomponents of t\. So |T*| < n-(z'-jn) — (y' — jri) < 
3771. Let T + be the subtree of T formed by i£ and its outcomponents, and let T~ be the 
subtree of T formed by t± and its incomponents. So |T + | < y + 1 and |T~| < z + 1. Also, 
each component of T + — ^ and each component of T~ — t\ is a component of T — T\ and 
so has order at most 72/ A by Proposition 12.11 

Suppose that some vertex v G X has at least an inneighbours in Y and at least an 
outneighbours in Z. Since £ > 3, we may choose i with 1 < i < £. Embed ti to v. Let T a 
be the subtree of T* consisting of U and all of its outcomponents, and let T, be the subtree 
of T* consisting of U and all of its incomponents. Then |T a |,|T,| < \T*\ < 3777,. So by 
Lemma [2. 6 1 we may extend the embedding of ti in {v} to an embedding of T a in ZL){v}, and 
similarly we may extend the embedding of ti in {v} to an embedding of in Y U {v}. Then 
in particular t\ is embedded to some v\ G Y and tg is embedded to some V£ G Z. So tji has 
at least \Z\ — 3777, > z + 3777 + 2n/A inneighbours in Z, at most 3777 of which are occupied 
by the embedding of T a . Similarly vg has at least \Y\ — 3777 > y + 3777, + 2n/ A outneighbours 
in 1", at most 3777 of which are occupied by the embedding of T^. So by Lemma 12.61 we 
may extend the embedding of t\ in {v\} to an embedding of T~ in {t;i} U Z and also extend 
the embedding of ti in {vi\ to an embedding of T + in {vg} U Y so that these embeddings 
together form a copy of T in G. 

So we may assume that no vertex of X has at least an inneighbours in Y and at least 
an outneighbours in Z. Let X + C X consist of all vertices of X with fewer than an 
inneighbours in Y, and let X~ C X \ X + consist of all vertices of X \ X + with fewer than 
an outneighbours in Z. Let Y* := Y U X~ and let Z* := Z U X + , so y* and Z* partition 
the vertices of G. Then any vertex of Y* has at most an outneighbours in Z, and thus at 
least z + an inneighbours in Z* (by (a), (|12p and the fact that z' > nn). Similarly any vertex 
of Z* has at most an inneighbours in Y, and therefore at least y + an outneighbours in Y* . 
Let W C V(G) consist of all vertices in Y* with at least y + an outneighbours in Y* and 
all vertices in Z* with at least z + an inneighbours in Z* . 

Now suppose that \W\ > |Ta|. Since Ta is a directed path, by Theorem 11.51 we may 
embed Ta in G[W]. Let C W be the set of vertices occupied by this embedding. Then 
|<Sa| = |Ta| < |Tv| < fin. So every vertex of 5a has at least y + an/2 > y + 2n/A 
outneighbours in Y* \ and at least z + an/2 > z + 2n/A inneighbours in Z* \ Sa- Let T\ 
be the subtree of T consisting of Ta and all of its outcomponents, and let T2 be the subtree 
of T consisting of Ta and all of its incomponents. So |Ti| = I + y and \T^\ = t + z. Also, 
each component of T\ — Ta and each component of T2 — Ta is a component of T — Ta , and so 
has order at most 77/ A by Proposition [2J] So by Lemma [2.61 we may extend the embedding 
of Ta in Sa to an embedding of T\ in Y* U Sa- Similarly by Lemma 12.61 we may extend the 



A PROOF OF SUMNER'S UNIVERSAL TOURNAMENT CONJECTURE FOR LARGE TOURNAMENTS 33 



embedding of Ta in 5a to an embedding of T2 in Z* U Sa- These embeddings of T\ and T2 
do not overlap outside Ta, and so together form an embedding of T in G. 

We may therefore assume that \W\ < |?a|, and hence that \G — W\ > 2n — 1 — I. Since 
y+z = n-£, we must have either |Y*\W| > 2y or |Z*\W| > 2z. Suppose that |Y*\W| > 2y. 
Then Y* \W contains a vertex vg with at least y outneighbours in Y*. So we may choose a 
set N C N+(v e ) n Y* with \N\ = y. Then \NDY\>y- (|Y*| - \Y\) > y - 2jn. Now, by 
(a), (b) and (fT2j) every vertex of Y has at least (1 — 2^y)y inneighbours in Y and at least 
(1 — 2y/~j)y outneighbours in Y. Since |JV| = y, at most 6^/7?/ vertices of N PI Y have more 
than (1 — 3^/7)2/ inneighbours in N DY, and at most 6y^yy vertices of N n Y" have more 
than (1 — 3 y / 7 y)y outneighbours in iV n Y. So at least | N n Y"| — 12^/7n > 3n/A vertices of 
iV have at least 6n/A inneighbours in Y* \ (N U {vg}) and at least 6n/A outneighbours in 
Y* \ (N U {t^}). This means that by Lemma [231 (applied with Y* \ (N U {w^}) playing the 
role of X) we may embed T + in Y* with ^ embedded to V£, and at most 4n/A vertices of 
T + embedded outside N. Since vg £ W, vg has at most y + an outneighbours in Y*, and so 
vg has at least \Y\ — 1 — (y + an) — 4n/A > 97n inneighbours in y which are not occupied 
by the embedding of T + . Since \T*\ < 3jn, by Lemma 12.61 we may extend the embedding 
of tg in vg to an embedding of T* in Y which only overlaps the embedding of T + in tg. The 
vertex t\ of T will therefore be embedded to some vertex v% £ Y. By (3), v\ then has at 
least \Z\ — 3771 > z + 2n/A inneighbours in Z, none of which will have been occupied by the 
embeddings of T* and T + so far. So by Lemma 12.61 we may extend the embedding of ti in 
{17} to an embedding of T~ in Z U {i>i}. Then the embeddings of T + , T~ and T* combine 
to form an embedding of T in G. If instead we have \Z* \ W\ > 2z, then we may embed T 
in G similarly, first embedding T~ in Z*, then embedding T* in the unoccupied vertices of 
Z, and finally embedding T + in Y. So in either case G contains a copy of T, completing the 
proof. □ 



7. Proof of Theorem 11.11 

Having proved that Sumner's conjecture holds for directed trees of small core, we now 
show that the same is true for directed trees of large core, which will complete the proof of 
Theorem ll.il We begin with an embedding result similar to Lemma 16.31 

Lemma 7.1. Suppose that 1/n « 1/A « /i « K< i) « 7 « a « ,8 « 1. Let T 

be a directed tree on n vertices, and let forests F~ and F + be induced subgraphs of T 
which partition the vertices of T such that \F + \ > 6an. Suppose also that every edge of T 
between F~ and F + is directed from F~ to F + . Let Y and Z be disjoint sets with \Y\ > 
2\F + \ — 2an and \Z\ > 2\F~\ +an, and let G be a tournament on vertex set YU Z such that 
every vertex ofY has at most j\G\ outneighbours in Z and every vertex of Z has at most 
*y\G\ inneighbours in Y. Finally, let T x + be the largest component of F + , and suppose that 
either 

(i) \lt\ < \F+\ - 3an, 

(ii) G[Y] is a robust (n,u)-outexpander with 5°(G[Y]) > n\Y\ and |(T 1 + )a| > fin, or 

(iii) A(T+) A. 

Then G contains a copy of T . 

Proof. First observe that if \G\ > 3n, then G contains a copy of T by Theorem 11.31 So we 
may assume that |G| < 3n, and hence that every vertex of Y has at most 37n outneighbours 
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in Z and every vertex of Z has at most 3jn inneighbours in Y. Let T 2 be the second largest 
component of F + . Then - |T 2 + | > \F+\/2 > 3cm, so \Y\ > \F+\ + |T 2 + | + an. Since 
\Z\ > 2|.F _ | + an, by Lemma 12,71 any embedding of T± in G[Y] may be extended to an 
embedding of T in G. So it is sufficient to embed in 

Note that |y| > 10cm, so if \T^\ < an, then G[Y] contains a copy of by Theorem 1 1.31 
Alternatively, suppose that \T^~ | > an. If (i) holds, then \T^~\ < \Y\/2 — 2an, and so 
\Y\ > (2 + a)|7^ |. So G[Y] contains a copy of by Theorem 11.21 1). If instead (ii) holds 
then G contains a copy of by Lemma 15.31 Finally, if (iii) holds then G contains a copy 
of by Theorem ll.2( ii) , completing the proof. □ 

Observe that as with Lemma 12.71 and Lemma 16.31 a 'dual' form of Lemma 17.11 can be 
proved similarly. For this we instead require that that \F"\ > 6cm, \Y\ > 2|i ?+ | + an and 
\Z\ > 2\F~\ — 2an, and also either that the largest component (T^)a of F~ contains at 
most \F~ \ — 3an vertices, or that G[Z] is a robust (u, i/)-outexpander with <5°(G[Z]) > rj\Z\ 
and KT] - )^! > /3n, or that A(T 1 _ ) < A. If these conditions are met we may conclude that G 
contains a copy of T. As with Lemma 12.71 we shall sometimes implicitly refer to this 'dual' 
when referring to Lemma 17.11 

The next lemma is our final result we need to proof Theorem 11.11 It states that if we can 
find disjoint subsets 7,ZC V{G) containing almost all of the vertices of G, so that G[Y] and 
G[Z] are robust outexpanders of large minimum semidegree with almost all edges between 
Y and Z directed the same way, then G contains a copy of T. 

Lemma 7.2. Suppose that 1/n < 1/A < fj, < v < n < 7 < a < /3 < 1. Let T be a 

directed tree on n vertices with \T&\ > (3n. Let Y and Z be disjoint sets with \Y U Z\ > 
(2 — a)n, and let G be a tournament on vertex set Y U Z such that 

(i) G[Y] is a robust (u,u)-outexpander with 5°(G[Y]) > n\Y\, 

(ii) G\Z\ is a robust (u,u)-outexpander with 5°(G[Z]) > rj\Z\, and 

(iii) every vertex ofY has at most "f\G\ outneighbours in Z, and every vertex of Z has at 
most 7|G| inneighbours in Y . 

Then G contains a copy of T . 

Proof. If \Y U Z\ > (2 + a)n, then G contains a copy of T by Theorem ll.2( i). So we may 
assume that \Y U Z\ = (2 ± a)n. Suppose first that \Z\ < QAan. Then \Y\ > (2 — 65a)n, 
and hence G[Y] contains a copy of T by (i) and Lemma 15.31 Similarly if \Y\ < 64an, then 
by (ii) and Lemma 15.31 G[Z] contains a copy of T. So we may assume that |y| > QAan and 



So we may form a forest F^~ of order between \Y \/2 + Aan and \Y \/2 + ban by repeatedly 
choosing a sink vertex of T, deleting it from T and adding it to Let F7 :=T — F^~, so 
that 



We therefore have |y| > 2)^1 — Wan and \Z\ > 21^"! +6an. Note also that \FJ~\ > 36an. 
Let T' be the largest component of F^~ . If |T'| < \F±~ \ — 18cm or \T' A \ > (3n/3 then G contains 
a copy of T by (i), (iii) and Lemma 17.11 So we may assume that \T'\ > - 18cm, and 
that \T' A \< pn/3. 



Z\ > Man. 



(13) 
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Next we form a forest -F 2 ~ which is a subgraph of T and which contains F7 . To do this, 
take -F 2 _ initially to be F[~ . Then select a source vertex of , delete it from and add it 
to -F 2 _ , and repeat this step until \Z\/2 + 4an < |-F 2 _ | < \Z\/2 + 5an, and let F% '■= T — F^ ■ 
Then by flT3]) we have \Ff n^| = \F^\ - |Ff| < 11cm. Also |F 2 + | < \Y\/2 - 3an, and so 
we have both \Z\ > 2|i ? 2 ~| — Wan and |Y| > 2|-F 2 + | + 6an. Observe also that |-F 2 ~| > 36an. 
Let T" be the largest component of -F 2 _ . Then if |T"| < |T 2 _ | — 18an then G contains a 
copy of T by (ii), (iii) and Lemma 17. 11 So we may assume that \T"\ > |-F 2 _ | — 18cm. Clearly 
|T' n T"| < |F+ n F 2 ~| < 11cm, and so \T' U T"\ > \T'\ + |T"| - \T n T"| > (1 - 47a)n. 
This implies that |T^| > /3n/3, as otherwise by Lemma 12.31 we would have |Ta| < /3n, a 
contradiction. Thus G contains a copy of T by (ii), (iii) and Lemma l7.lt as desired. □ 



Proof of Theorem 11.11 Introduce new constants with 

1/n < 1/A <^<i/<r/<7<a<a / </3<l. 

If |Ta| < /3n then G contains a copy of T by Lemma [6. 11 So we may assume that \T&\ > /3n. 
Let x := |Ta|, let y be the outweight of Ta, and let z be the inweight of Ta, so x + y + z = n. 
Also let Ti be the subtree of T formed by Ta and all outcomponents of Ta, and let T2 be 
the subtree of T formed by Ta and all incomponents of Ta, so |Ti| = x + y, and | T2 1 = x + z. 
By Lemma I5T21 we may choose disjoint subsets S%, . . . , S r of V(G) such that 

(i) IU eW ^|>(l-7)|G|, 

(ii) for each i G [r], any vertex v S Si has at most ~f\G\ inneighbours in Uj>i &j an d at 
most j\G\ outneighbours in Uj<i an d 

(iii) for each i £ [r], either is a robust (//, z/)-outexpander with 5 (G[Si\) > rj\G\ or 
N<7|G|- 

Let i be maximal such that \S\ U • • • U Sj_i| < max{2(z — an), 4an}, and let j be minimal 
such that \Sj+i U ■ • • U S r \ < max{2(y — an),4an}. Since y + z < n — fin, by (i) we have 
i < i (though equality is possible here). Let Z := S\ U • • • U Si, let Y" := Sj U • • • U S r and 
let X := Si+i U • • • U Sj-i. Then we have 

(14) \Z\Si\ < max{2(z — an), Aan} and \Y \Sj\ < max{2(y — an), Aan}. 
Also, by the maximality of i and the minimality of j we have 

(15) \Z\ > z + an and |Y| > y + an. 

Claim. If \Z\ Si\ > Wan or \Y \Sj\ > Wan then G contains a copy ofT. 

To prove the claim, suppose first that \Y \ Sj\ > llan. Let X~ := Z U X U Sj and 
X + := Y \ Sj. By (114h we have < 2y — 2an. Also, by (ii) every vertex in X~ has 

at most "f\G\ inneighbours in X + and every vertex in X + has at most r y\G\ outneighbours 
in X~ . Now, T\ — Ta is a forest on y > |A + |/2 + an vertices in which each component 
has order at most n/A by Proposition I2.1( iv). So by repeatedly deleting a source vertex 
of T\ — Ta, we may obtain a subforest F + on between |A + |/2 + 2an/3 and |A + |/2 + an 
vertices. So |T + | > 6an, and each component of F + has order at most n/A < |T + | — 3an. 
Let F + := T — F~ , so every edge of T between F~ and F + is directed from F~ to T + . 
Since \X+\ + \X~\> (1 ~7)|G| by (i), we have 

. ,_ . ... \X + \ 2an \X~\ an 

\F~\ =n — \F + \ < n — < . 

11 1 1 - 2 3~2 2 
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So \X~\ > 2\F~\ + an, and \X + \ > 2\F + \ — 2an, and so G contains a copy of T by 
Lemma [7. H i). If instead \Z \ Si\ > 11 an then G contains a copy of T similarly. This proves 
the claim. 

We may therefore assume that \Z \ Si\ < Wan and \Y \ Sj\ < Wan. Suppose first that 
i = j. Then \Si\ > (1 — j)\G\ — 22an > (2 — a')n, so by (iii) G[Si] is a robust (u,z^)- 
outexpander with 5°(G[Si]) > rj\G\ > r)\Si\. Thus G contains a copy of T by Lemma [531 
Now suppose instead that i ^ j, and also that \X\ < \2a'n. Then \Si U Sj\ > (1 — 7)|G| — 
\X\ - 22an > (2 - 13a')™- N o w if l^il < l\G\, then we must have \Sj\ > (2 - 14a')n. Then 
by (iii) GfSj] must be a robust (^, i/)-outexpander with 5°(G[Sj]) > n\G\ > r/|5j|, so G[Sj] 
contains a copy of T by Lemma 15.31 Alternatively, if \Sj\ < j\G\ then G[Si\ contains a copy 
of T similarly. Finally, if |jSi|, \Sj\ > "f\G\, then by (iii) G[S%] and G[Sj] must both be robust 
(/i, z^)-outexpanders with 5°(G[Si\) > rj\G\ > n\Si\ and <5°(G[5j]) > n\Sj\. Also, by (ii) every 
vertex of Si has at most 7|G| inneighbours in Sj, and every vertex of Sj has at most ~f\G\ 
outneighbours in Si. So G[Si U Sj] contains a copy of T by Lemma 1731 

So we may assume that i ^ j, and also that \X\ > 12a' n. We next consider two cases for 
the size of X, in each case showing that T may be embedded in G. 

Case 1: \X\ > {l + a)x. 

Since by Proposition I2.1( iii) we have A (2a) < A, by Theorem ll.2( ii) we may embed Ta 
in Let X' C X consist of the vertices occupied by this embedding. Now, by (ii) 

every vertex of X' has at most j\G\ inneighbours in Y, and hence by (|15|) at least y + an/ 2 
outneighbours in Y. Since by Proposition I2.1l fiv) every component of T\ — Ta has order at 
most n/A, by Lemma 12.61 we may extend the embedding of Ta in to an embedding 

of T\ in G[X'U7]. Similarly by (ii) every vertex of X' has at most j\G\ outneighbours in Z, 
and hence by (|15p at least z + an/2 inneighbours in Z. Since by Proposition I2.1( iv) every 
component of T2 — Ta has order at most n/A, by Lemma [2.6l we may extend the embedding 
of Ta in to an embedding of T2 in G[X' U Z]. Since these embeddings of T\ and T2 

only overlap in Ta, they together form an embedding of T in G. 

Case 2: \X\ < (1 + a)x. 

Observe that if \Z\ < 2z + an and \Y\ < 2y + an, then by (i) and the fact that x = |Ta| > 
f3n we have 

1^1 > (1 - l)\G\ - \Z\ - \Y\ > 2n - 2z - 2y - 3an > 2x - 3an > (1 + a)x, 

contradicting our assumption on X. So at least one of \Z\ > 2z + an and \Y\ > 2y + an 
must hold. This gives us three further cases, which we consider separately. 

Case 2(a): \Z\ > 2z + an, \Y\ <2y + an. 

In this case it is sufficient to embed T2 in G[X U Z]. Indeed, by (ii) every vertex of XL) Z 
has at most j\G\ inneighbours in Y, and therefore by fjl5[> at least y + an/2 outneighbours 
in Y. Since by Proposition I2.1( iv) every component of T — T2 has order at most n/A, any 
embedding of T2 in G[X U Z] can be extended to an embedding of T in G by Lemma 12.61 

Now, if \X U Z\ > 2|T2| + 2an, then we may embed T2 in G[X U Z] by Theorem ll.2( i). 
So we may assume that \X U Z\ < 2|T2 1 + 2an. Also, by (i) we have 

\X U Z\ > (1 - j)\G\ - \Y\ >2n-2y- 2an = 2x + 2z- 2an = 2|T 2 | - 2an. 

So \X U Z\ = 2|T2| ± 2an. In particular, since | T2 1 > |Ta| > f3n, we have \X U Z\ > f3n. 
By repeatedly deleting a source vertex of Ta, we may form a forest F which is an induced 
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subgraph of Ta (consisting of the undeleted vertices of Ta) so that every edge between 
Ta — F and F is directed from Ta to F, and also so that 



\X\,2o^\ \X\ , 



Let F~ := To — F. Then 



ln . lm , Im . X 2a' \T 2 \ Z a'\T 2 \ 

\F~\ = T 2 - \F\ < T 2 - - — — < — — . 

2 3 ~ 2 2 

So |X| > 2\F\ - 2a'\T 2 \ and \Z\ > 2\F~\ + a'|T 2 |. Also, |F| > \X\/2 > 6a'|T 2 |, and since F 
is a subtree of Ta, by Proposition 12. K m) each component C of F has A(C) < A. Since 
by (ii) every vertex of X has at most r y\G\ < 27|A~ U Z\//3 outneighbours in Z and every 
vertex of Z has at most j\G\ < 27|X U Z\//3 inneighbours in X, G[X U Z\ contains a copy 
of T 2 by Lemma mi as required. 

Case 2(b): \Z\ < 2z + an, |Y| > 2y + an. 

In this case T may be embedded in G by the same method as in the previous case, with 
the roles of inneighbours and outneighbours switched. So we begin by embedding T\ in 
G[X U Y], and then use Lemma 12.61 to extend this embedding to an embedding of T in G. 

Case 2(c): \Z\ > 2z + an, \Y\ > 2y + an. 

In this case, we shall partition T into three forests as follows. Initially take F~ to be the 
forest formed by all incomponents of Ta, and F + to be the forest formed by all outcompo- 
nents of Ta- Then select a source vertex of Ta, delete it from Ta and add it to F~ . Repeat 
this step until 2|T~| + an < \Z\ < 2|T~| + 2an. Next, select a sink vertex of Ta, delete it 
from Ta and add it to F + . Repeat this step until 2|T + | + an < \Y\ < 2|T + | + 2an. Then 
let F consist of all vertices remaining in Ta- So F is a subgraph of Ta- Also, by (i) 

\F\=n- \F~\ - \F + \ < n - \Y\/2 - \Z\/2 + 2an < \X\/2 + 3an, 

so (since \X\ > a'n) \X\ > \F\ + an. We shall embed the components of T~, F and F + in 
turn amongst the vertices of Z, X and Y respectively. Indeed, the proof is similar to the 
proof of Lemma 12.71 but with three forests instead of two. 

Let Ci, . . . , C s be the components of F~, F and F + , ordered so that C\ is a component of 
F, and for each i G [s — 1], Cj+i has precisely one neighbour in C% U • • • U Q. We shall embed 
the C{ in turn, so that each component of F~ is embedded in G[Z], each component of F is 
embedded in G[A], and each component of F + is embedded in G[Y"]. We also require that 
after each Q is embedded, the embeddings of Ci, . . . , Cj together form an embedding in G of 
the subtree of T induced by the vertices of Ci, . . . , G{. So suppose that we have successfully 
embedded C\, . . . , Cj_i in this manner, and we now wish to extend this embedding to include 
Cj. Then if i > 2, there is precisely one edge of T between and C\ U • • • U Cj_i. Let 
t be the endvertex of this edge in C\ U • • • U Cj_i, and let v be the vertex to which t was 
embedded. If Cj is a component of F~ , then i > 2, the edge between i and Ci is directed 
towards i and v £ X L)Y . So we may let S consist of the inneighbours of v in Z. Then by 
(ii) we have |5| > \Z\ — 7|C|. Let 5" C S consist of the unoccupied vertices of S. Since at 
most \F~\ — \d\ vertices of S are occupied by the embeddings of Ci, . . . , Cj_i, 

\S'\ > \Z\ - j\G\ - \F~\ + \Ci\ > 2\G\ + an/2. 

So if | Cj | < an/2 then G[S'] contains a copy of T by Theorem 11.31 an d if |Cj| > an/2 
then G[S'] contains a copy of T by Theorem ll.2( i). Alternatively, if Cj is a component of 
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F + , then i > 2, the edge between t and Cj is directed towards Ci and v £ X U Z. So we 
may let S 1 consist of the outneighbours of w in Y, and let 5' C S consist of the unoccupied 
vertices of S. Then we may embed Cj in S' by the same argument as used when Cj is 
a component of i 7- . Finally, suppose that Cj is a component of F. Then if i > 2 and 
i € -F" 1 ", let 5 consist of the inneighbours of t> in X. If instead i > 2 and t € i 7- , let S 
consist of the outneighbours of v in X. If i = 1 then let S = X. Then by (ii) we have 
\S\ > \X\ — 7|G|. Again let S' C S consist of the unoccupied vertices of S. Then it suffices 
to embed Ci in £*[£"]. Since at most \F\ — \G\ vertices have been embedded in S, we have 
> \X\ - j\G\ - \F\ + \d\ > \d\ + an/2. Now, Q is a subtree of T A , so A(d) < A by 
Proposition I2.1lf iii) . So if \C{\ > an/ '4, then G[S"] contains a copy of Ci by Theorem [L2jii) . 
On the other hand, if |Cj| < an/4, then G[S'\ contains a copy of Ci by Theorem 11.31 So in 
any case we may embed Ci as desired, completing the proof. □ 
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